Databases Flashcards

1
Q

What are disks vs. databases?

A

Disks: EFS, EBS, EC2 Instance Store, S3

Databases: RDS, Aurora, ElastiCache, DynamoDB, Redshift, EMR, Athena, QuickSight, DocumentDB, Neptune, QLDB, Managed Blockchain, DMS, Glue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a database let you do that a disk can’t?

A

Structure data, build indices to efficiently query/search data, and define relationships b/w your datasets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Benefits of no-SQL databases? What form do they take?

A

Flexible, scalable, high-performance, and high-functional

JSON

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the benefit of using AWS databases?

A

AWS manages them:

  • Quick Provisioning, High Availability, Vertical and Horizontal Scaling
  • Automated Backup & Restore, Operations, Upgrades
  • Operating System Patching is handled by AWS
  • Monitoring, alerting
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are relational databases?

A

RDS & Aurora (SQL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

In-memory Database

A

ElastiCache

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Key/Value Database? Can it be used serverless? What’s its caching mechanism?

A

DynamoDB (serverless)

DAX (cache for DynamoDB)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data warehouse

A

Redshift (SQL)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Hadoop Cluster

A

EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How can you query data on Amazon S3?

What can it handle? (options below)
servers
severless
sql 
no-sql
A

Athena

Serverless and can handle SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a managed Hyperledger Fabric & Ethereum blockchains?

A

Amazon Managed Blockchain

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Managed ETL (Extract Transform Load) and Data Catalog service

A

Glue

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DMS & traits?

A

Database migration system - allows intervals, quickly and securely migrates DB to AWS in a resilient and self healing, while allowing source database to remain available during the migration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Graph database

A

Neptune

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

dashboards on your data (serverless)

A

QuickSight

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

“Aurora for MongoDB” (JSON – NoSQL database)

A

DocumentDB

17
Q
What allows you to create databases managed by AWS for the following:
• Postgres
• MySQL
• MariaDB
• Oracle
• Microsoft SQL Server
• Aurora (AWS Proprietary database)
18
Q

What are the pros & cons to RDS vs. DB on EC2?

A

RDS is a managed service:
• Automated provisioning, OS patching
• Continuous backups and restore to specific timestamp (Point in Time Restore)!
• Monitoring dashboards
• Read replicas for improved read performance
• Multi AZ setup for DR (Disaster Recovery)
• Maintenance windows for upgrades
• Scaling capability (vertical and horizontal)
• Storage backed by EBS (gp2 or io1)
• BUT you can’t SSH into your instances

19
Q

What are supported by Aurora DB (which is cloud-optimized)?

A

PostgreSQL and MySQL

20
Q

What are 3 unique features of RDS deployments?

A
  1. Can read replicas to scale the read workload of your DB (1 app can read from 3 Amazon RDS - 1 main/2 replicas)
  2. Can do multi-AZ and failover in case of AZ outage (high availability); read/write to main RDS but rep across AZ to 1 other RDS
  3. Can do multi-region; if disaster in 1 region, can do local performance for global reads, ensure disaster recovery in case of region issues, but charge rep cost
21
Q

What do you use ElastiCache for?

A

To get managed Redis or Memcached; caches are in-memory DB and allows reducing load of DB for read-intensive wkloads; allows quicker read/write from cache

22
Q

What is a noSQL DB that can scale to massive workloads due to its distributed serverless DB with single-digit milisecond latency?

23
Q

What is a fully managed in-memory cache for DynamoDB that improves performance by 10X?

24
Q

What is a postgreSQL based OLAP that has columnar storage of data?

25
What helps create Hadoop clusters (Big Data) to analyze and process vast amount of data, where clusters could be hundreds of EC2 instances; and is used for data processing, machine learning, web indexing, and big data?
Elastic MapReduce
26
What is a fully Serverless database with SQL capabilities?
Athena
27
What are the use cases of Athena?
one-time SQL queries, serverless queries on S3, log analytics
28
What is the relationship between S3 and Athena?
Query data in S3 and get output to S3
29
What is the pricing of Athena?
Pay per query
30
What is a serverless machine learning-powered business intelligence service to create interactive dashboards?
Amazon QuickSight
31
What is AWS version of a NoSQL database for storing, querying, and indexing JSON data?
DocumentDB
32
What is a fully managed graph database (e.g. social network) that is highly available (3 AZ) and up to 15 read replicas?
Amazon Neptune
33
What can be used to review history of all the changes made to your application data over time, and is immutable?
Quantum ledger database
34
How is QLDB different from Amazon Managed Blockchain?
No decentralization component, in accordance with | financial regulation rules
35
What is a serverless AWS svc used to manage extract, transform, and load (ETL), i.e. useful for preparing and transforming data for analytics?
AWS Glue
36
What can Glue Data Catalogue be used by?
Athena, Redshift, EMR (Elastic MadReduce)
37
What svc can be used for quickly and securely migrating databases to AWS, resilient, self healing?
Database Migration Svc (DMS)