Databases on AWS Flashcards

1
Q

Relational databases on AW

A
  1. SQL Server
  2. Oracle
  3. MySQL Server
  4. PostgreSQL
  5. Aurora
  6. MariaDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS key feature

A

Multi-AZ for DR

Read Replicas for Performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is data warehouseing?

A

Used for business intelligence

Used to pull in very large and complex data sets

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

OLTP

A

Online Transaction Processing

RDS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

OLAP

A

Online Analytics Processing

Datawarehousing
RedShift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

OLTP vs OLAP

A

OLTP -

OLAP - Complicated queries for inferring info from data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ElastiCache

A

web service that makes it easy to deploy, operate, and scale an in-memory cache in the cloud.

Used to speed up performance of existing databases by caching frequent identical queries

Flavors:
- memcached
- redis
Helps when DBs get overloaded

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

RDS

A

Relational database

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Red Shift

A

Amazon’s OLAP / Datawarehousing solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Amazon’s NoSQL solution

A

DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

RDS backup types

A

Automated backups

Database snapshots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Automated Backups of RDS

A
  • allow you to recover your db to any point in time within a retention period.
  • backups take a full daily snapshot and store transaction logs thru the day
  • During a recovery, AWS will choose the most recent snapshot and then apply the relevant transactions
  • Enabled by default
  • backup data stored in S3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Automated backup retention period

A

1 - 35 days

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Database snapshots

A

done manually

stored even after RDS instance is deleted

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Restoring backups

A

restored version will be a new RDS instance w/ a new DNS endpoint

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

DB encryption at rest

A

Done using the KMS service

encrypts backups, read replicas, and snapshots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Multi-AZ

A
  • designed for DR
  • creating a copy of the DB in a different AZ
  • standby DB synced automatically
  • If primary AZ goes down -> update DNS to point to backup in secondary AZ
  • supports all but Aurora
    • Aurora is fault-tolerant on its own
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Read replica

A

Production db asynchronously writing new data to secondary DBs.
If there is too much load on the prod db,
Could also point individual EC2 instances to specific DBs
Used for read-heavy DB workloads
Available for all DBs
Used for scaling
must have automatic backups on
up to 5 read replica copies of any DB
Each have it’s own DNS endpoint
Can have multi-AZ
can be in a separate region from primary db

19
Q

Redshift key management

A

By default, redshift takes care of key management

You can also manage your own keys through HSM or AWS KMS

20
Q

Redshift Mult-AZ

A

Only available in 1 AZ - no multi-az

Can restore snapshots to a new AZ in event of outage

21
Q

Redshift backup retention

A

enabled by default for 1 day retention period

max retention period = 35days

22
Q

Redshift number of data copies

A

Tries to maintain 3 copies:
original
replica on compute nodes
backup in S3

23
Q

Aurora compatible SQL languages

A

MySQL

PostgreSQL

24
Q

Aurora relational vs noSQL

A

Aurora is a relational database

25
Q

Aurora performance vs MySQL and PostgreSQL

A

5x better than MySQL

3x better than PostgreSQL

26
Q

Aurora storage scaling

A

10GB - 64TB

27
Q

Aurora DR

A

designed to handle loss of up to two copies of data w/out affecting db write availability
and up to three copies w/out affected read availability

self-healing: data blocks and disks scanned for errors and repaired automatically

28
Q

Aurora replica types

A

Aurora replicas (15)
MySQL read replicas (5)
PosgreSQL (1)

29
Q

Aurora read replication

A

asynchronous in milliseconds
In-region (no cross-region)
Automated failover

30
Q

Aurora Serverless

A

on-demand autoscaling

good for infrequent, intermittent, or unpredictable workloads

31
Q

Aurora copies

A

6 copies; 2 copies stored in each availability zone, w/ a minimum of 3 AZs

32
Q

Aurora Snapshot sharing

A

can be shared w/ other AWS accounts

33
Q

memcached vs redis

A

memcached

  • simple
  • multithreaded

redis

  • multi-az
  • advanced data types
  • backup/restore
34
Q

DMS - acronym

A

Database migration service

35
Q

DMS - definition

A

service to make migrating easy for relational databases, data warehouses, nosql, and other types of data stores.
You can migrate into the cloud, between on-prem instances, or any combo of the two

36
Q

Caching services

A

CloudFront
API Gateway
ElastiCache (Memcached, Redis)
DAX

37
Q

EMR - definition

A

big data platform for processing large amounts of data

38
Q

EMR - acronym

A

Elastic map reduce

39
Q

EMR cluster

A

cluster - collection of EC2 instances
node - EC2 instance in the cluster
node type - node’s role within the cluster

40
Q

EMR node types

A

Master - manages the cluster
Core - runs tasks and stores data (in HDFS)
Task - runs tasks. does NOT store data

41
Q

EMR backup

A

Configure an archive of log files from master node to S3
Can only be configured when creating the cluster
archive done in 5 min intervals

42
Q

Amazon Athena

A

query services that makes it easy to analyze data in S3 using SQL commands

43
Q

Athena supported data formats

A

JSON, Parquet, ORC