Databases on AWS Flashcards

(43 cards)

1
Q

Relational databases on AW

A
  1. SQL Server
  2. Oracle
  3. MySQL Server
  4. PostgreSQL
  5. Aurora
  6. MariaDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS key feature

A

Multi-AZ for DR

Read Replicas for Performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data warehouseing?

A

Used for business intelligence

Used to pull in very large and complex data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

OLTP

A

Online Transaction Processing

RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

OLAP

A

Online Analytics Processing

Datawarehousing
RedShift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

OLTP vs OLAP

A

OLTP -

OLAP - Complicated queries for inferring info from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ElastiCache

A

web service that makes it easy to deploy, operate, and scale an in-memory cache in the cloud.

Used to speed up performance of existing databases by caching frequent identical queries

Flavors:
- memcached
- redis
Helps when DBs get overloaded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

RDS

A

Relational database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Red Shift

A

Amazon’s OLAP / Datawarehousing solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Amazon’s NoSQL solution

A

DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

RDS backup types

A

Automated backups

Database snapshots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Automated Backups of RDS

A
  • allow you to recover your db to any point in time within a retention period.
  • backups take a full daily snapshot and store transaction logs thru the day
  • During a recovery, AWS will choose the most recent snapshot and then apply the relevant transactions
  • Enabled by default
  • backup data stored in S3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Automated backup retention period

A

1 - 35 days

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Database snapshots

A

done manually

stored even after RDS instance is deleted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Restoring backups

A

restored version will be a new RDS instance w/ a new DNS endpoint

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

DB encryption at rest

A

Done using the KMS service

encrypts backups, read replicas, and snapshots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Multi-AZ

A
  • designed for DR
  • creating a copy of the DB in a different AZ
  • standby DB synced automatically
  • If primary AZ goes down -> update DNS to point to backup in secondary AZ
  • supports all but Aurora
    • Aurora is fault-tolerant on its own
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Read replica

A

Production db asynchronously writing new data to secondary DBs.
If there is too much load on the prod db,
Could also point individual EC2 instances to specific DBs
Used for read-heavy DB workloads
Available for all DBs
Used for scaling
must have automatic backups on
up to 5 read replica copies of any DB
Each have it’s own DNS endpoint
Can have multi-AZ
can be in a separate region from primary db

19
Q

Redshift key management

A

By default, redshift takes care of key management

You can also manage your own keys through HSM or AWS KMS

20
Q

Redshift Mult-AZ

A

Only available in 1 AZ - no multi-az

Can restore snapshots to a new AZ in event of outage

21
Q

Redshift backup retention

A

enabled by default for 1 day retention period

max retention period = 35days

22
Q

Redshift number of data copies

A

Tries to maintain 3 copies:
original
replica on compute nodes
backup in S3

23
Q

Aurora compatible SQL languages

A

MySQL

PostgreSQL

24
Q

Aurora relational vs noSQL

A

Aurora is a relational database

25
Aurora performance vs MySQL and PostgreSQL
5x better than MySQL | 3x better than PostgreSQL
26
Aurora storage scaling
10GB - 64TB
27
Aurora DR
designed to handle loss of up to two copies of data w/out affecting db write availability and up to three copies w/out affected read availability self-healing: data blocks and disks scanned for errors and repaired automatically
28
Aurora replica types
Aurora replicas (15) MySQL read replicas (5) PosgreSQL (1)
29
Aurora read replication
asynchronous in milliseconds In-region (no cross-region) Automated failover
30
Aurora Serverless
on-demand autoscaling | good for infrequent, intermittent, or unpredictable workloads
31
Aurora copies
6 copies; 2 copies stored in each availability zone, w/ a minimum of 3 AZs
32
Aurora Snapshot sharing
can be shared w/ other AWS accounts
33
memcached vs redis
memcached - simple - multithreaded redis - multi-az - advanced data types - backup/restore
34
DMS - acronym
Database migration service
35
DMS - definition
service to make migrating easy for relational databases, data warehouses, nosql, and other types of data stores. You can migrate into the cloud, between on-prem instances, or any combo of the two
36
Caching services
CloudFront API Gateway ElastiCache (Memcached, Redis) DAX
37
EMR - definition
big data platform for processing large amounts of data
38
EMR - acronym
Elastic map reduce
39
EMR cluster
cluster - collection of EC2 instances node - EC2 instance in the cluster node type - node's role within the cluster
40
EMR node types
Master - manages the cluster Core - runs tasks and stores data (in HDFS) Task - runs tasks. does NOT store data
41
EMR backup
Configure an archive of log files from master node to S3 Can only be configured when creating the cluster archive done in 5 min intervals
42
Amazon Athena
query services that makes it easy to analyze data in S3 using SQL commands
43
Athena supported data formats
JSON, Parquet, ORC