Domain 2: Storage Flashcards

1
Q

What are the two main categories of storage systems?

A

Operational and Analytical

OP: RDS, DynamoDB, ElastiCache, Neptune
AN: Redshift, S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Characteristics of Operational Data Stores

A
  • Data stored in a row-based format
  • Smaller compute size
  • Low latency
  • High throughput
  • High concurrency
  • High change velocity
  • Usually a good fit for caching
  • Mission-critical, HA, DR, data protection
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Characteristics of RDS

A
  • Fast - Choose between two SSD-backed storage options for high performance OLTP storage
  • Scales Vertically - RDS is bounded by instance and storage sizes
  • Reliable and Durable - Offers multi-AZ, automated backups, snapshots, and failovers
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Characterstics of DynamoDB

A
  • Fast - Single-digit millisecond latency at any scale
  • Scales Horizontally - Useful for storing unbounded data, providing low cost and high performance regardless of size
  • Reliable and Durable - Data is replicated across three fault-tolerant AZs with fine grained access control. Offers global tables for multi-region replication
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Characterstics of ElastiCache

A
  • Extreme performance - In-memory data store and cache using optimized stack to deliver sub-millisecond response times
  • Reliable and Durable - Redis offers multi-az storage with automatic failover
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Characteristics of Neptune

A
  • Fast and Scalable - Store billions of relationships and query with milliseconds latency
  • Scales Vertically - Bounded by instance size and storage
  • Relaiable and Durable - Six replicas of your data across 3 AZ with full backup and restore
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Which services can query data from S3 directly?

A

Redshift Spectum and Athena can query data in S3 with high speed and low latency without moving it out of S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Characteristics of Redshift

A
  • Fast - Columnar storage technology to improve I/O efficiency and parallelize queries. Data loads linearly. Redshift also provides fastest query results using higher cost storage
  • Reliable and Durable - Replicates your data within your data warehouse cluster and continuously backs up your Data to Amazon S3, which has 11 9s of durability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Characterisitcs of S3

A
  • Fast - provides a fast reliable platform for querying structured and semi structured data. Redshift Spectrum and Amazon Athena are able to query data without moving it from S3 at high speed with low latency
  • Reliable and Durable - Data is always replicated across 3 AZ in the same region, 11 9s durability. S3 also offers same-region and cross region replication for even greater durability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly