3. AWS Databases Flashcards

1
Q

What are the Relational Databases on AWS?

A

The Relational Databases on AWS are:

  • SQL Server
  • Oracle
  • MySQL Server
  • PostgreSQL
  • Aurora
  • MariaDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the two key features of Relational Databases on AWS?

A

The two key features of AWS Relational Databases are:

  • Multi-AZ for DR
  • Read replicas for performance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is Data Warehousing?

A

Data Warehousing is used for business intelligence.

It is used to pull in very large and complex data sets in order to perform management queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Are Databases best suited to OLTP or OLAP processes?

A

Databases are best suited to OLTP processes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are Datawarehouses better suited to OLTP or OLAP processes?

A

Datawarehouses are better suited to OLAP processes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Elasticache?

A

Elasticache is a web service to deploy, operate and scale an in-memory cache in the Cloud.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does Elasticache do?

A

Elasticache improves the performance of web applications by allowing you to retrieve information from fast memory caches rather than from slower disk-based databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the name of AWS own Relational Database?

A

RDS is the name of the AWS own Relational Database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Can you access the RDS VMs?

A

No, the RDS VMs are totally inaccessible

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the two types of RDS backup?

A

Automated backups and Database snapshots are the two ways to back-up RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do RDS automated backups work?

A

Automated backups take a daily snapshot and also transaction logs throughout the day.

This gives a point in time recovery of the RDS to within a second.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Where are RDS automated backups stored?

A

RDS automated backups are stored in S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

When might you expect increased latency with your RDS instance?

A

You may expect increased latency during the backup window as I/O may be suspended.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What are RDS DB Snapshots?

A

RDS DB Snapshots are user initiated and persist even after the original RDS is deleted (unlike automated backups).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What happens when you restore either an Automated backup or DB Snapshot?

A

When you perform a restore, the restored version will be a new RDS instance with a new DNS endpoint

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Multi-AZ?

A

Multi-AZ means that you can have an exact copy of your DB in another AZ.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

For what DBs is Multi-AZ available?

A

Multi-AZ is available for:

  • SQL Server
  • Oracle
  • MySQL Server
  • PostgreSQL
  • MariaDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is a Read Replica?

A

Read Replicas allow you to have a read-only copy of your database.

These help with performance for read-heavy databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What are Read Replicas used for?

A

Read Replicas are used to help with scaling and performance, not DR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What must be enabled in order to use Read Replicas?

A

Automated Backups must be enabled to use Read Replicas

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Can Read Replicas be promoted to their own database?

A

Read Replicas can be promoted to their own database but this breaks the replication

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Can Read Replicas be across different regions?

A

Yes Read Replicas can be across different regions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

How is ‘encryption at rest’ performed on AWS databases?

A

Encryption is done using the AWS Key Management Service (KMS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is DynamoDB?

A

DynamoDB is a fast and flexible NoSQL database.

Instead of creating tables and rigid schema, populating those tables and using SQL to query and return results, with DynamoDB you create tables with attributes and store data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Q

What are Eventual Consistent Reads?

A

Eventual Consistent Reads is when consistency across all copies of the data is achieved within a second

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
26
Q

What are Strongly Consistent Reads?

A

When you request a strongly consistent read, DynamoDB returns a response with the most up-to-date data, reflecting the updates from all prior write operations that were successful

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
27
Q

What is DynamoDB Accelerator (DAX)?

A

DAX is a fully managed and highly available in-memory cache that can give performance improvements of up to 10x

28
Q

What is the Database Migration Service (DMS)?

A

AWS DMS allows you to migrate existing databases to AWS hosted databases.

The source and destination database types can be the same or different with the migration process taking care of the schema creation and code changes required to run the new database type.

29
Q

What is RedShift?

A

Redshift is a fully managed petabyte scale data warehouse service in the cloud.

Redshift allows you to perform business intelligence operations on historical data to answer business questions.

30
Q

How can Redshift be configured?

A

Redshift can be configured as either Single Node or Multi Node.

Multi-node:
- Leader Node (manages client connections and receives queries)
- Compute Node (store data and perform queries and computations)

31
Q

What is Massively Parallel Processing (MPP)?

A

MPP is the method by which Redshift automatically distributes data and load across all modules.

32
Q

How do backups work in Redshift?

A

Redshift backups:

  • Maximum retention period of 35 days
  • Redshift attempts to maintain at least 3 copies of the data (2 on compute nodes, 1 copy in S3)
  • Ability to replicate snapshots to S3 in another region
33
Q

What is Aurora?

A

Aurora is a MySQL and PostgreSQL compatible RDS engine.

You can also configure up to 15 read replicas of your database, so you can disperse the read workload across different edge locations.

In terms of data security, Aurora continuously backs up your data to Amazon S3 with time codes so you can perform a point-in-time restore.

34
Q

How is data stored in Aurora?

A

Aurora databases are replicated across several physical locations so at any one time you have 6 copies of your database in existence.

35
Q

What is Aurora Serverless?

A

Aurora Serverless is an on-demand, autoscaling version of Aurora

36
Q

Why would you use Aurora Serverless?

A

Use cases:
- Infrequently used applications.
- New applications (when you’re unsure about the instance size you need).
- Variable and unpredictable workloads.
- Development and test databases.
- Multi-tenant applications.

37
Q

What 3 types of Replicas are available with Aurora?

A

The 3 types of available replicas available with Aurora are:
- Aurora Replicas
- MySQL Replicas
- PostgreSQL Replicas

38
Q

What are the two types of Elasticache engines?

A

The two Elasticache engine types are:

  • Memcached
  • Redis
39
Q

What Services have caching capabilities?

A

The following services have caching capabilities:

  • CloudFront
  • API Gateway
  • Elasticache (Memcached and Redis)
  • DynamoDB (DAX)
40
Q

What data formats does Amazon Athena support?

A

Athena supports Apache ORC, Apache Parquet and JSON

41
Q

What does RAID 0 provide?

A

RAID 0 provides performance improvements compared with a single volume as data can be read and written to multiple disks simultaneously.

42
Q

What is the largest size by default you can have for an RDS for SQL Server database?

A

You can create Amazon RDS for SQL Server database instances with up to 16TB of storage.

43
Q

What data transfer charge is incurred when replicating data between Availability Zones for your Amazon RDS MySQL in a Multi-AZ deployment?

A

Data transferred between Availability Zones for replication of Multi-AZ deployments is free.

44
Q

In RDS, when are changes to the backup window implemented?

A

During the next scheduled maintenance window or immediately

45
Q

If I wanted to run a database on an EC2 instance, which storage option would Amazon recommend?

A

Elastic Block Storage (EBS) is recommended block level storage for EC2 instances if you were running a database on an EC2 instance.

46
Q

When creating a single-AZ Amazon RDS instance, can you can select the Availability Zone into which you deploy it?

A

When you create a DB instance, you can choose an Availability Zone or have AWS choose one for you.

47
Q

MySQL installations default to port number ____.

A

The default endpoint port for MySQL installations is 3306.

48
Q

Which DynamoDB features are chargeable, when using a single region?

A

The chargeable features are:
- the provisioning of read and write capacity
- and the storage of data within DynamoDB

49
Q

With new RDS DB instances, are automated backups are enabled by default?

A

Yes automated backups are enabled by default for new RDS DB instances

50
Q

What happens to the I/O operations of a single-AZ RDS instance during a database snapshot or backup?

A

I/O may be briefly suspended while the backup process initializes (typically under a few seconds), and you may experience a brief period of elevated latency.

51
Q

Which SQL-based relational database is suitable for high-performance OLTP (Online Transactional Processing) workloads?

A

Amazon RDS with provisioned IOPS (SSD) storage allows you to implement a SQL-based relational database solution for your high-performance OLTP workloads.

52
Q

What is EMR?

A

EMR (Amazon Elastic MapReduce) is for big data processing.

It is a Cloud big data platform for processing vast amounts of data using tools like Apache Spark, Apache Hive and others

53
Q

What is the central component of EMR?

A

The central component of EMR is the cluster. A cluster is a collection of EC2 instances.

54
Q

Within EMR, what is an EC2 instance in a cluster referred to?

A

An EC2 instance in the cluster is referred to as a node

55
Q

What are the node types in EMR?

A

The node types are:

  • Master Node
  • Core Node
  • Task Node
56
Q

What two types of db instance make up an Aurora cluster?

A

Primary and Replica instances make up an Aurora cluster

57
Q

What is the difference between a Primary and Replica instance in Aurora?

A

A Primary instance supports read and write operations against the cluster volume, Replica instances support only read operations.

58
Q

How many Aurora replicas can each Aurora db cluster have?

A

Each Aurora DB cluster can have up to 15 Aurora Replicas in addition to the primary DB instance.

59
Q

What features are not supported by Aurora serverless?

A

Not supported features:
- Aurora global databases.
- Aurora multi-master clusters.
- Aurora Replicas.
- IAM database authentication.
- Performance Insights.

60
Q

What type of storage is best suited to transactional data and database storage?

A

Block storage is best suited to transactional data and database storage

61
Q

What are two ways to increase the performance of your database?

A

You can improve database performance using Read Replicas and Elasticache

62
Q

What is Redshift used for?

A

Redshift is used for Business Intelligence or Datawarehousing

63
Q

Which type of backups are retained even after the RDS instance has been deleted?

A

Snapshot backups are retained after the RDS instance has been deleted whereas Automated backups are deleted.

64
Q

What happens when you restore either a snapshot or automated RDS backup?

A

The restore is a new instance with a new DNS endpoint

65
Q

When the at rest, how is your data in RDS encrypted?

A

The data at rest in RDS is encrypted using KMS (Amazon Key Management Service)

66
Q

What is a key use case for DynamoDB?

A

DynamoDB database is massively scalable and responds within a millisecond making it ideal for high traffic applications where performance is critical.

All the performance and scaling operations are taken care of by AWS so all you have to worry about is your data.