Section 9: Databases and Analytics Flashcards

1
Q

What language do Relational Databases use?

A

SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are NoSQL Databases?

A

Non-relational Databases

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Benefits of NoSQL Databases

A
  • Flexibility: easy to evolve data model
  • Scalability: designed to scale-out by using distributed clusters
  • High-performance: optimized for a specific data model
  • Highly functional: types optimized for the data model
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

AWS RDS is

A

Relational Database Service that allows you to create databases in the cloud that are managed by AWS

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon Aurora is

A

An AWS cloud optimized database service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Aurora is (more/less) expensive than RDS but (more/less) efficient

A

Aurora is more expensive than RDS but more efficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is Amazon Aurora Servrerless?

A

Automated database instantiation and auto-scaling based on actual usage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Read Replica RDS Deployment

A

Scale the read workload of your DB
* Can create up to 15 Read Replicas
* Data is only written to the main DB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Multi-AZ RDS Deployment

A
  • Failover in case of AZ outage (high availability)
  • Data is only read/written to the main database
  • Can only have 1 other AZ as failover
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Multi-Region RDS deployment

A
  • Disaster recovery in case of region issue
  • Local performance for global reads
  • Replication cost
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is Amazon ElasticCache?

A

AWS managed in-memory databases with high performance, low latency that helps reduce load off databases for read intensive workloads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

DynamoDB

A
  • Fully Managed Highly available with replication across 3 AZ
  • NoSQL database - not a relational database
  • Scales to massive workloads, distributed “serverless” database
  • Millions of requests per seconds, trillions of row, 100s of TB of storage
  • Fast and consistent in performance
  • Single-digit millisecond latency – low latency retrieval
  • Integrated with IAM for security, authorization and administration
  • Low cost and auto scaling capabilities
  • Standard & Infrequent Access (IA) Table Class
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

DynamoDB Accelerator - DAX

A

Fully Managed in-memory cache for DynamoDB only

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

DynamoDB – Global Tables

A
  • Make a DynamoDB table accessible with low latency in multiple-regions
  • Active-Active replication (read/write to any AWS Region)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Redshift

A

*OLAP
*Columnar Storage
*Massively Parallel Query Execution (MPP)
*SQL
*Data Warehouse
*BI tools intergration

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Redshift Serverles

A
  • Automatically provisions and scales data warehouse underlying capacity
  • Run analytics workloads without managing data warehouse infrastructure
  • Pay only for what you use (save costs)
  • Use cases: Reporting, dashboarding applications, real-time analytics…
17
Q

Elastic MapReduce (EMR)

A

*helps creating Hadoop clusters (Big Data) to analyze and process
vast amount of data
* Auto-scaling and integrated with Spot instances
* Use cases: data processing, machine learning, web indexing, big
data…

18
Q

Athena

A
  • Serverless query service to analyze data stored in Amazon S3
  • Uses standard SQL language to query the files
  • Exam Tip: analyze data in S3 using serverless SQL, use Athena
19
Q

QuickSight

A
  • Serverless machine learning-powered business intelligence service to
    create interactive dashboards
  • Fast, automatically scalable, embeddable, with per-session pricing
  • Use cases: Business analytics, Building visualizations, Perform ad-hoc analysis, Get business insights using data
20
Q

DocumentDB

A

*The MongoDB of AWS
*NoSQL
*used to store, query, and index JSON data
* Fully Managed, highly available with replication across 3 AZ
*Automatically scales

21
Q

Amazon Neptune

A
  • Fully managed graph database
  • Highly available across 3 AZ, with up to 15 read replicas * Build and run applications working with highly connected
    datasets – optimized for these complex and hard queries
  • Great for knowledge graphs (Wikipedia), fraud detection, recommendation engines, social networking
22
Q

Amazon Timestram

A
  • Fully managed, fast, scalable, serverless time series database
  • Automatically scales up/down to adjust
    capacity
  • Store and analyze trillions of events per day * 1000s times faster & 1/10th the cost of relational databases
  • Built-in time series analytics functions (helps you identify patterns in your data in near real-time)
23
Q

Quantum Ledger Database

A
  • Fully Managed, Serverless, High available, Replication across 3 AZ
  • Used to review history of all the changes made to your application data over time
  • Immutable system: no entry can be removed or modified, cryptographically verifiable
  • Difference with Amazon Managed Blockchain: no decentralization component, in accordance with financial regulation rules
24
Q

Amazon Managed Blockchain

A
  • Amazon Managed Blockchain is a managed service to jJoin public blockchain networks or create your own scalable private network
  • Compatible with the frameworks Hyperledger Fabric & Ethereum
25
Q

AWS Glue

A
  • Managed extract, transform, and load (ETL) service
  • Useful to prepare and transform data for analytics
  • Fully serverless service
  • can be used by Athena, Redshift, EMR
26
Q

Database Migration Service (DMS)

A
  • Quickly and securely migrate databases
    to AWS, resilient, self healing
  • The source database remains available
    during the migration
    *Supports homogenous and heterogenous migrations
27
Q
A