Chapter 5 Flashcards

1
Q

What would you use for Tabular data with a well defined schema?

A

AWS Relational Database Service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What would you use for analytics and reporting workloads that are heavy ?

A

A data warehouse like RedShift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between RDS and Redshift?

A

RDS stores data using row-level storage where as Redshift uses column based.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What would you use for structured very large datasets?

A

RedShift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

If you data is semi-structured what repository should you consider?

A

DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How does DynamoDB store data?

A

As key-value pairs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can you use to store data that doesn’t really have a schema?

A

DynamoDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

If you data currently lives in an open-source NoSQL store like MongoDB how can you migrate it easily to AWS?

A

Amazon DocumentDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What would you use to centrally manage and govern data access across multiple repositories?

A

AWS Data Lake Formation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What tools could be used to run analytics or ETL workstreams on data in the data lake?

A

Amazon RedShift or EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly