Databases and Analytics Flashcards

1
Q

RDS is for Online Transaction Processing (OLTP) workloads that _____ lots of _____ transactions

A

RDS is for Online Transaction Processing (OLTP) workloads that process lots of small transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Online Transaction Processing (OLTP) workloads use cases would be:

_______ orders

_______ transactions

Payments

_______ systems

A

Online Transaction Processing (OLTP) workloads use cases would be

Customer orders

Banking transactions

Payments,

Booking systems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Redshift is a ________ database that is used for Online ________ Processing (OLAP) or data ___________

A

Redshift is a relational database that is used for Online Analytics Processing (OLAP) or data warehousing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Online Analytics Processing (OLAP) is ideal for tasks like analyzing large amounts of data _________ and sales ___________

A

Online Analytics Processing (OLAP) is ideal for tasks like analyzing large amounts of data reporting and sales forecasting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon RDS Read Replicas enable you to create one or more ____-____ _______ of your _______ instance within the _____ AWS Region or in a _________ AWS Region

A

Amazon RDS Read Replicas enable you to create one or more read-only copies of your database instance within the same AWS Region or in a different AWS Region

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Read replicas are primarily used for _______ and improving performance not ________ ________

A

Read replicas are primarily used for scaling and improving performance not disaster recovery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Automatic ______ must be enabled in order to deploy a read ______

A

Automatic backups must be enabled in order to deploy a read replica

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Multi-AZ in regards to databases is used only for disaster recovery. In the event of a failure, RDS will automatically ________ to the _______ instance

A

Multi-AZ in regards to databases is used only for disaster recovery. In the event of a failure, RDS will automatically failover to the standby instance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Scenario question: Your database is bottlenecking how can you get around it?

A

Create a Read Replicas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Aurora is Amazon’s proprietary database that is compatible with ______, as well as __________.

A

Aurora is Amazon’s proprietary database that is compatible with MySQL, as well as PostgreSQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Amazon Aurora is very redundant. You always have __ copies of your data in each ___ within a minimum of __ AZ’s. Giving you a total of __ copies

A

Amazon Aurora is very redundant. You always have #2 copies of your data in each AZ with a minimum of #3 AZ’s. Giving you a total of #6 copies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

You can take ________ with Auroras and share them with other AWS ________

A

You can take snapshots with Auroras and share them with other AWS accounts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DynamoDB is spread across #__ geographically ______ data centers to ensure _________

A

DynamoDB is spread across 3 geographically distinct data centers to ensure resiliency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

With DynamoDB you get eventually ________ reads by default

A

With DynamoDB you get eventually consistent reads by default

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the three read options you get with DynamoDB?

A

eventual consistency,
strong consistency,
transactional.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

The acronym ACID refers to the four key properties of a transaction:

A

Atomicity,
Consistency,
Isolation,
Durability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Scenario questions that mention ACID requirements should make you think of _________ transactions

A

Scenario questions that mention ACID requirements should make you think of DynamoDB transactions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What effect does DynamoDB On-Demand Backup have on the performance or availability of your tables?

____ impact on performance while offering ____ backups

A

Zero impact on performance while offering full backups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

In which region is DynamoDB On-Demand Backups retained?

A

Same region as the source table

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

DynamoDB Point-in-Time Recovery protects against accidental ______ or _______

A

DynamoDB Point-in-Time Recovery protects against accidental writes or deletes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

When using DynamoDB Point-in-Time Recovery what is the last restorable point in the past?

A

5 minutes in the past.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Streams are time-ordered sequences of ____ level changes in a ____.

A

Streams are time-ordered sequences of item level changes in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Every shard in a stream is stored for how long?

A

24 hours

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

A Database on an EC2 is ideal if you need (2)

____ _______ over instance and database or have a _____-_____ database engine (not avail in RDS)

A

full control over instance and database or have a third-party database engine (not avail in RDS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

Amazon RDS is ideal if you have data that is well-______ and ________

A

Amazon RDS is ideal if you have data that is well-formed and structured

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

DynamoDB 4 main features:

A

NoSQL
High I/O needs
Dynamic Scaling
In-memory performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

Data Warehouse for large volumes of aggregated data

A

Amazon Redshift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
28
Q

Amazon Elasticache is fast _________ storage for _____ amounts of data.

Amazon Elasticache is an in-_______ store.

A

Amazon Elasticache is fast temporary storage for small amounts of data.

Amazon Elasticache is an in-memory store.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
29
Q

Amazon EMR is an ________ workload using the ________ framework

A

Amazon EMR is an Analytics workload using the Hadoop framework

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
30
Q

What are the 6 Database engines that Amazon RDS supports?

_____ Server

______ SQL

___ Server

_____

_____

______

A

MySQL Server

PostgreSQL

SQL Server

Aurora

Oracle

MariaDB

31
Q

Amazon RDS has a ______ maintenance schedule by default but you can choose your own. OS & DB ________ is what happens during the weekly maintenance window

A

Amazon RDS has a weekly maintenance schedule by default but you can choose your own. OS & DB patching is what happens during the weekly maintenance window

32
Q

When is the only time you can enable encryption on an AWS RDS DB instance?

A

When you create it
DBs that has encryption enabled cannot be disable it

33
Q

You cannot have an ________ read replica of an _________ DB instance

A

You cannot have an encrypted read replica of an unencrypted DB instance

34
Q

You cannot restore an unencrypted _______ or _______ to an encrypted __ instance

A

You cannot restore an unencrypted backup or snapshot to an encrypted DB instance

35
Q

Read replicas of encrypted primary instances are ________

A

Read replicas of encrypted primary instances are encrypted

36
Q

Amazon Aurora is up to five times faster than standard _____ databases and three times faster than standard _________ databases

A

Amazon Aurora is up to five times faster than standard MySQL databases and three times faster than standard PostgresSQL databases

37
Q

Aurora Serverless Use Cases (5)

New or ___________ used applications

  • *_____**-tenant applicatons
  • *___________** Workloads
  • *_______** Workloads
  • *___________** and test databases
A

New or Infrequently used applications

  • *Multi**-tenant applicatons
  • *Unpredictable** Workloads
  • *Variable** Workloads
  • *Development** and test databases
38
Q

Amazon ElastiCache is a fully managed caching service for _____ and _________

A

Amazon ElastiCache is a fully managed caching service for Redis and Memcached

39
Q

Amazon ElastiCache is a ____/____ store

A

Amazon ElastiCache is a key/value store

40
Q

Amazon ElastiCache can be put in front of databases such as ____ and _________

A

Amazon ElastiCache can be put in front of databases such as RDS and DynamoDB

41
Q

ElastiCache nodes run on ___ _________ so you must choose an ________ ______ type

A

ElastiCache nodes run on EC2 instances so you must choose an instance family type

42
Q

Between Memcached and Redis, which offers Data Persistence?

A

Between Memcached and Redis, which offers Data Persistence?

Redis offers Data Persistence
Memcached does not

43
Q

Memcached will place nodes in Multi-AZs but you will not get ________ or _________

A

Memcached will place nodes in Multi-AZs but you will not get failover, or replication

44
Q

You can use ElastiCache for caching, which accelerates ___________ and ________ performance

A

You can use ElastiCache for caching, which accelerates application and database performance

45
Q

Amazon ElastiCache can also be a primary data store for use cases that don’t require durability like gaming ________, ________, and _______

A

Amazon ElastiCache can also be a primary data store for use cases that don’t require durability like gaming leaderboards, streaming, and analytics.

46
Q

You can restore your DynamoDB database backup to any point in the last ___ days, the backups are __________. This feature is not _________ by _________.

A

At any point in the last 35 days, the backups are incremental. Not enabled by default.

47
Q

DynamoDB Accelerator is an __-_______ cache that increases __________ (microsecond latency)

A

The acronym DAX represents DynamoDB Accelerator which is an In-memory cache that increases performance (microsecond latency)

48
Q

What type of backups and copies of your data does Redshift offer?

Redshift offers continuous _________ backups
And always keeps ______ copies of your data

A

continuous incremental backups
Always keeps three copies of your data

49
Q

Amazon EMR can be used for ___________ and ______ large amounts of ____

A

Amazon EMR can be used for transforming and moving large amounts of data

50
Q

Kinesis Data Streams enables real-time processing of ________ ___ ____

A

Kinesis Data Streams enables real-time processing of streaming big data

51
Q

The Kinesis Client Library helps you ______ and ______ data from a Kinesis data stream

A

The Kinesis Client Library helps you consume and process data from a Kinesis data stream

52
Q

Kinesis Data Firehose _____, _____ and loads streaming data

A

Kinesis Data Firehose Captures, Transforms, and loads streaming data

53
Q

With Kinesis Data Firehose there are no ____, everything is ______

A

With Kinesis Data Firehose there are no Shards, everything is automated

54
Q

Between Memcached and Redis, which offers encryption?

A

Between Memcached and Redis, which offers encryption?

Redis offers encryption
Memcached does not

55
Q

Kinesis Data Firehose enables ____ real-time _______ with existing business intelligence tools and dashboards

A

Kinesis Data Firehose enables near real-time analytics with existing business intelligence tools and dashboards

56
Q

Kinesis Data Firehose possible destinations:

S3
_____
Data___
HTTP ______
Mongo___

A

S3
Splunk
Datadog
HTTP Endpoint
MongoDB

57
Q

Kinesis Data Analytics provides ____-____ SQL processing for streaming data

A

Kinesis Data Analytics provides real-time SQL processing for streaming data

58
Q

Kinesis Data Analytics destination can be

Kinesis ________
Kinesis _________
Lambda

A

Kinesis Data Streams
Kinesis Data Firehose
Lambda

59
Q

Amazon Athena is used for _______ ____ in S3 using SQL

A

Amazon Athena is used for querying data in S3 using SQL

60
Q

You would connect Amazon Athena to data sources other than S3 by using _____.

A

You would connect Amazon Athena to data sources other than S3 by using Lambda

61
Q

The methods that can be used to optimize Amazon Athena include:

_______ you data
______ your data
________ your data

A
  • *Partition** you data
  • *Bucket** your data

Compress your data

62
Q

AWS Glue is a fully managed extract, _______, and ____ service that is used for preparing data for analytics.

A

AWS Glue is a fully managed extract, transform and load service that is used for preparing data for analytics.

63
Q

AWS Glue discovers data and stores the associated metadata in the AWS ____ ____ Catalog

A

AWS Glue discovers data and stores the associated metadata in the AWS Glue Data Catalog

64
Q

You can use a _______ to populate the AWS Glue Data Catalog with _______

A

You can use a crawler to populate the AWS Glue Data Catalog with tables

65
Q

Between Memcached and Redis, which offers Multithreading?

A

Between Memcached and Redis, which offers Multithreading?

Redis offers Multithreading
Memcached does not

66
Q

A crawler can crawl multiple data stores in a ______ ___

A

A crawler can crawl multiple data stores in a single run

67
Q

A real-time solution to process or move data is called _____.

A

A real-time solution to process or move data is called Kinesis

68
Q

Simple Queue Service (SQS) and Kinesis can both be queues, but each has its pros and cons:

SQS is _____ and _____
Kinesis is _____ & can store data for up to a ____

A

SQS is easier and simpler
Kinesis is faster & can store data for up to a year

69
Q

Anytime Serverless SQL comes up on the test think of _______ _______.

A

Anytime Serverless SQL comes up on the test think of Amazon Athena

70
Q

Quicksight is a service that is used for _______ the data in a dashboard or _____

A

Quicksight is a service that is used for visualizing the data in a dashboard or graph

71
Q

The acronym DAX represents __________ ___________

A

The acronym DAX represents DynamoDB Accelerator

72
Q

An Aurora global database consists of ___ primary AWS Region where your data is mastered and up to ____ read-only, _________ AWS Regions

A

An Aurora global database consists of one primary AWS Region where your data is mastered and up to five read-only, secondary AWS Regions

73
Q

Amazon Athena is an interactive _____ service that makes it easy to _______ data in Amazon __ using standard SQL.

A

Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL.

74
Q

Athena is serverless, so there is no infrastructure to manage, and you pay only for the ______ that you run.

A

Athena is serverless, so there is no infrastructure to manage, and you pay only for the queries that you run.