Databases and Analytics Flashcards

1
Q

RDS is for Online Transaction Processing (OLTP) workloads that _____ lots of _____ transactions

A

RDS is for Online Transaction Processing (OLTP) workloads that process lots of small transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Online Transaction Processing (OLTP) workloads use cases would be:

_______ orders

_______ transactions

Payments

_______ systems

A

Online Transaction Processing (OLTP) workloads use cases would be

Customer orders

Banking transactions

Payments,

Booking systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Redshift is a ________ database that is used for Online ________ Processing (OLAP) or data ___________

A

Redshift is a relational database that is used for Online Analytics Processing (OLAP) or data warehousing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Online Analytics Processing (OLAP) is ideal for tasks like analyzing large amounts of data _________ and sales ___________

A

Online Analytics Processing (OLAP) is ideal for tasks like analyzing large amounts of data reporting and sales forecasting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon RDS Read Replicas enable you to create one or more ____-____ _______ of your _______ instance within the _____ AWS Region or in a _________ AWS Region

A

Amazon RDS Read Replicas enable you to create one or more read-only copies of your database instance within the same AWS Region or in a different AWS Region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Read replicas are primarily used for _______ and improving performance not ________ ________

A

Read replicas are primarily used for scaling and improving performance not disaster recovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Automatic ______ must be enabled in order to deploy a read ______

A

Automatic backups must be enabled in order to deploy a read replica

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Multi-AZ in regards to databases is used only for disaster recovery. In the event of a failure, RDS will automatically ________ to the _______ instance

A

Multi-AZ in regards to databases is used only for disaster recovery. In the event of a failure, RDS will automatically failover to the standby instance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Scenario question: Your database is bottlenecking how can you get around it?

A

Create a Read Replicas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Aurora is Amazon’s proprietary database that is compatible with ______, as well as __________.

A

Aurora is Amazon’s proprietary database that is compatible with MySQL, as well as PostgreSQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Amazon Aurora is very redundant. You always have __ copies of your data in each ___ within a minimum of __ AZ’s. Giving you a total of __ copies

A

Amazon Aurora is very redundant. You always have #2 copies of your data in each AZ with a minimum of #3 AZ’s. Giving you a total of #6 copies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

You can take ________ with Auroras and share them with other AWS ________

A

You can take snapshots with Auroras and share them with other AWS accounts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DynamoDB is spread across #__ geographically ______ data centers to ensure _________

A

DynamoDB is spread across 3 geographically distinct data centers to ensure resiliency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

With DynamoDB you get eventually ________ reads by default

A

With DynamoDB you get eventually consistent reads by default

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three read options you get with DynamoDB?

A

eventual consistency,
strong consistency,
transactional.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The acronym ACID refers to the four key properties of a transaction:

A

Atomicity,
Consistency,
Isolation,
Durability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Scenario questions that mention ACID requirements should make you think of _________ transactions

A

Scenario questions that mention ACID requirements should make you think of DynamoDB transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What effect does DynamoDB On-Demand Backup have on the performance or availability of your tables?

____ impact on performance while offering ____ backups

A

Zero impact on performance while offering full backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In which region is DynamoDB On-Demand Backups retained?

A

Same region as the source table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

DynamoDB Point-in-Time Recovery protects against accidental ______ or _______

A

DynamoDB Point-in-Time Recovery protects against accidental writes or deletes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When using DynamoDB Point-in-Time Recovery what is the last restorable point in the past?

A

5 minutes in the past.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Streams are time-ordered sequences of ____ level changes in a ____.

A

Streams are time-ordered sequences of item level changes in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Every shard in a stream is stored for how long?

A

24 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

A Database on an EC2 is ideal if you need (2)

____ _______ over instance and database or have a _____-_____ database engine (not avail in RDS)

A

full control over instance and database or have a third-party database engine (not avail in RDS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Amazon RDS is ideal if you have data that is well-**\_\_\_\_\_\_** and **\_\_\_\_\_\_\_\_**
Amazon RDS is ideal if you have data that is well-**formed** and **structured**
26
DynamoDB 4 main features:
NoSQL High I/O needs Dynamic Scaling In-memory performance
27
Data Warehouse for large volumes of aggregated data
Amazon Redshift
28
Amazon Elasticache is fast **\_\_\_\_\_\_\_\_\_** storage for **\_\_\_\_\_** amounts of data. Amazon Elasticache is an in-**\_\_\_\_\_\_\_** store.
Amazon Elasticache is fast **temporary** storage for **small** amounts of data. Amazon Elasticache is an in-**memory** store.
29
Amazon EMR is an **\_\_\_\_\_\_\_\_** workload using the **\_\_\_\_\_\_\_\_** framework
Amazon EMR is an **Analytics** workload using the **Hadoop** framework
30
What are the 6 Database engines that Amazon RDS supports? **\_\_\_\_\_** Server **\_\_\_\_\_\_** SQL **\_\_\_** Server \_\_\_\_\_ \_\_\_\_\_ \_\_\_\_\_\_
MySQL Server PostgreSQL SQL Server Aurora Oracle MariaDB
31
Amazon RDS has a **\_\_\_\_\_\_** maintenance schedule by default but you can choose your own. OS & DB **\_\_\_\_\_\_\_\_** is what happens during the weekly maintenance window
Amazon RDS has a w**eekly** maintenance schedule by default but you can choose your own. OS & DB **patching** is what happens during the weekly maintenance window
32
When is the only time you can enable encryption on an AWS RDS DB instance?
When you create it DBs that has encryption enabled cannot be disable it
33
You cannot have an **\_\_\_\_\_\_\_\_** read replica of an **\_\_\_\_\_\_\_\_\_** DB instance
You cannot have an **encrypted** read replica of an **unencrypted** DB instance
34
You cannot restore an unencrypted **\_\_\_\_\_\_\_** or **\_\_\_\_\_\_\_** to an encrypted **\_\_** instance
You cannot restore an unencrypted **backup** or **snapshot** to an encrypted **DB** instance
35
Read replicas of encrypted primary instances are \_\_\_\_\_\_\_\_
Read replicas of encrypted primary instances are **encrypted**
36
Amazon Aurora is up to five times faster than standard _____ databases and three times faster than standard _________ databases
Amazon Aurora is up to five times faster than standard **MySQL** databases and three times faster than standard **PostgresSQL** databases
37
Aurora Serverless Use Cases (5) New or **\_\_\_\_\_\_\_\_\_\_\_** used applications * *\_\_\_\_\_**-tenant applicatons * *\_\_\_\_\_\_\_\_\_\_\_** Workloads * *\_\_\_\_\_\_\_** Workloads * *\_\_\_\_\_\_\_\_\_\_\_** and test databases
New or **Infrequently** used applications * *Multi**-tenant applicatons * *Unpredictable** Workloads * *Variable** Workloads * *Development** and **test** databases
38
Amazon ElastiCache is a fully managed caching service for _____ and \_\_\_\_\_\_\_\_\_
Amazon ElastiCache is a fully managed caching service for **Redis** and **Memcached**
39
Amazon ElastiCache is a \_\_\_\_/\_\_\_\_ store
Amazon ElastiCache is a **key/value** store
40
Amazon ElastiCache can be put in front of databases such as ____ and \_\_\_\_\_\_\_\_\_
Amazon ElastiCache can be put in front of databases such as **RDS** and **DynamoDB**
41
ElastiCache nodes run on ___ \_\_\_\_\_\_\_\_\_ so you must choose an ________ \_\_\_\_\_\_ type
ElastiCache nodes run on **EC2 instances** so you must choose an **instance family** type
42
Between Memcached and Redis, which offers Data Persistence?
Between Memcached and Redis, which offers Data Persistence? Redis offers Data Persistence Memcached does not
43
Memcached will place nodes in Multi-AZs but you will not get ________ or \_\_\_\_\_\_\_\_\_
Memcached will place nodes in Multi-AZs but you will not get **failover,** or **replication**
44
You can use ElastiCache for caching, which accelerates ___________ and ________ performance
You can use ElastiCache for caching, which accelerates **application** and **database** performance
45
Amazon ElastiCache can also be a primary data store for use cases that don't require durability like gaming \_\_\_\_\_\_\_\_, \_\_\_\_\_\_\_\_, and \_\_\_\_\_\_\_
Amazon ElastiCache can also be a primary data store for use cases that don't require durability like gaming **leaderboards**, **streaming, and analytics.**
46
You can restore your DynamoDB database backup to any point in the last **\_\_\_** days, the backups are **\_\_\_\_\_\_\_\_\_\_**. This feature is not **\_\_\_\_\_\_\_\_\_** by \_\_\_\_\_\_\_\_\_.
At any point in the last **35** days, the backups are **incremental**. Not **enabled** by **default**.
47
DynamoDB Accelerator is an **\_\_-\_\_\_\_\_\_\_** cache that increases **\_\_\_\_\_\_\_\_\_\_** (microsecond latency)
The acronym DAX represents DynamoDB Accelerator which is an **In-memory** cache that increases **performance** (microsecond latency)
48
What type of backups and copies of your data does Redshift offer? Redshift offers continuous **\_\_\_\_\_\_\_\_\_** backups And always keeps **\_\_\_\_\_\_** copies of your data
continuous **incremental** backups Always keeps **three** copies of your data
49
Amazon EMR can be used for ___________ and ______ large amounts of \_\_\_\_
Amazon EMR can be used for **transforming** and **moving** large amounts of **data**
50
Kinesis Data Streams enables real-time processing of ________ \_\_\_ \_\_\_\_
Kinesis Data Streams enables real-time processing of **streaming big data**
51
The Kinesis Client Library helps you ______ and ______ data from a Kinesis data stream
The Kinesis Client Library helps you **consume** and **process** data from a Kinesis data stream
52
Kinesis Data Firehose \_\_\_\_\_, _____ and loads streaming data
Kinesis Data Firehose **Captures, Transforms**, and loads streaming data
53
With Kinesis Data Firehose there are no \_\_\_\_, everything is \_\_\_\_\_\_
With Kinesis Data Firehose there are no **Shards**, everything is **automated**
54
Between Memcached and Redis, which offers encryption?
Between Memcached and Redis, which offers encryption? Redis offers encryption Memcached does not
55
Kinesis Data Firehose enables ____ real-time _______ with existing business intelligence tools and dashboards
Kinesis Data Firehose enables **near** real-time **analytics** with existing business intelligence tools and dashboards
56
Kinesis Data Firehose possible destinations: S3 \_\_\_\_\_ Data\_\_\_ HTTP \_\_\_\_\_\_ Mongo\_\_\_
S3 Splunk Datadog HTTP **Endpoint** Mongo**DB**
57
Kinesis Data Analytics provides \_\_\_\_-\_\_\_\_ SQL processing for streaming data
Kinesis Data Analytics provides **real-time** SQL processing for streaming data
58
Kinesis Data Analytics destination can be Kinesis \_\_\_\_\_\_\_\_ Kinesis \_\_\_\_\_\_\_\_\_ Lambda
Kinesis Data **Streams** Kinesis Data **Firehose** Lambda
59
Amazon Athena is used for _______ \_\_\_\_ in S3 using SQL
Amazon Athena is used for **querying data** in S3 using SQL
60
You would connect Amazon Athena to data sources other than S3 by using \_\_\_\_\_.
You would connect Amazon Athena to data sources other than S3 by using **Lambda**
61
The methods that can be used to optimize Amazon Athena include: \_\_\_\_\_\_\_ you data \_\_\_\_\_\_ your data \_\_\_\_\_\_\_\_ your data
* *Partition** you data * *Bucket** your data **Compress** your data
62
AWS Glue is a fully managed extract, \_\_\_\_\_\_\_, and ____ service that is used for preparing data for analytics.
AWS Glue is a fully managed extract, **transform** and **load** service that is used for preparing data for analytics.
63
AWS Glue discovers data and stores the associated metadata in the AWS ____ \_\_\_\_ Catalog
AWS Glue discovers data and stores the associated metadata in the AWS **Glue Data** Catalog
64
You can use a _______ to populate the AWS Glue Data Catalog with \_\_\_\_\_\_\_
You can use a **crawler** to populate the AWS Glue Data Catalog with **tables**
65
Between Memcached and Redis, which offers Multithreading?
Between Memcached and Redis, which offers Multithreading? Redis offers Multithreading Memcached does not
66
A crawler can crawl multiple data stores in a ______ \_\_\_
A crawler can crawl multiple data stores in a **single run**
67
A real-time solution to process or move data is called \_\_\_\_\_.
A real-time solution to process or move data is called **Kinesis**
68
Simple Queue Service (SQS) and Kinesis can both be queues, but each has its pros and cons: SQS is _____ and \_\_\_\_\_ Kinesis is _____ & can store data for up to a \_\_\_\_
SQS is **easier** and **simpler** Kinesis is **faster** & can store data for up to a **year**
69
Anytime Serverless SQL comes up on the test think of _______ \_\_\_\_\_\_\_.
Anytime Serverless SQL comes up on the test think of **Amazon Athena**
70
Quicksight is a service that is used for _______ the data in a dashboard or \_\_\_\_\_
Quicksight is a service that is used for **visualizing** the data in a dashboard or **graph**
71
The acronym DAX represents __________ \_\_\_\_\_\_\_\_\_\_\_
The acronym DAX represents DynamoDB Accelerator
72
An Aurora global database consists of **\_\_\_** primary AWS Region where your data is mastered and up to **\_\_\_\_** read-only, **\_\_\_\_\_\_\_\_\_** AWS Regions
An Aurora global database consists of **one** primary AWS Region where your data is mastered and up to **five** read-only, **secondary** AWS Regions
73
Amazon Athena is an interactive **\_\_\_\_\_** service that makes it easy to **\_\_\_\_\_\_\_** data in Amazon **\_\_** using standard SQL.
Amazon Athena is an interactive **query** service that makes it easy to **analyze** data in Amazon S3 using standard **SQL**.
74
Athena is serverless, so there is no infrastructure to manage, and you pay only for the **\_\_\_\_\_\_** that you run.
Athena is serverless, so there is no infrastructure to manage, and you pay only for the **queries** that you run.