Databases Flashcards

1
Q

RDS runs on VMs

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS is serverless

T or F

A

F

it is not serverless

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

aurora serverless is serverless

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

read replicas are used for scaling, not DR

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

must have auto backups turned on in order to deploy a read replica

T or F

A

T

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

You can have up to ___ read replica copies of any DB

A

5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

you can have read replicas of read replicas

T or F

A

T, but watch out for latency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

read replica facts:

each read replica will have its own DNS end point

you can have read replicas that have multi AZ

you can create read replicas of multi az source databases

read replicas can be promoted to be their own DB. THis breaks replication

you can have a read replica in a second region

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

2 types of backups for rds

automated backups

database backups

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

read replica facts

multi az

used to increase performance

must have backups turned on

can be in different regions

can be mysql, postgres, mariadb, oracle, aurora

can be promoted to master, this will break the read replica

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

multi az tips

used for DR

you can force a failover from one az to another by rebooting the instance.

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

This DB service is:

stored on ssd storage

spread across 3 geographically distinct data centers

eventual consistent reads (default)

strongly consistent reads

A

what is dynamo DB?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

consistency across all copies of data is usually reached within a second with this type of read. reoeating a read after a short time should return the updated data. (best read performance)

A

eventual consistent reads

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

A ____ consistent read returns a result that reflects all writes that received a successful response prior to the read

A

strongly

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

This is a fully managed, highly available, in memory cache for dynamo DB

10x performance improvement

reduces request time from milliseconds to microseconds - even under load

no need for developers to manage cache

compatible with dynamo db api calls

A

dynamo db accelerator (DAX)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

dynamo db transactions notes:

multiple all or nothing operations

financial transactions

fulfilling orders

two underlying reads or writes - prepare/commit

up to 25 items or 4 mb of data

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

this type of dynamo db capacity provides:

pay per request pricing

balance cost and performance

no minimum capacity

no charge for read/write - only storage and backups

pay more per request than with provisioned capacity

new product launches

A

on-demand capacity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

dynamo db on demand backup and restore notes:

full backups at any time

zero impact on table performance or availability

consistent within seconds and retained until deleted

operates withing same region as source code

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

dynamo db point in time recovery notes:

protects against accidental ______ or deletes

restore to any point in the last ____ days

_____ backups

not enabled by default

latest restorable: ____ minutes in the past

A

writes

35

incremental

five

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

dynamo db ___ are time ordered sequence of item lvel changes in a table

they are stored for 24 hours

inserts, updates, and deletes

combine with lambda functions for functionality like stored procedures

A

streams

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

dynamo db global tables notes

managed multi master, multi region replication

globally distributed apps

based on dynamo db streams

multi region redundancy for dr or ha

no app rewrites

replication latency under one second

A

yes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

DMS =

A

database migration service

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

dynamo db security

encyption at rest using ___

site to site ___

direct ____

IAM policies and ____

___ grained access

CW and CT

VPC endpoints

A

KMS

vpn

connect

roles

fine

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

____ is a fast and powerful, fully managed, petabyte scale data warehouse service in the cloud. Customers can start small for just .25 per hour with no commitments or upfront costs and scale toa. apetabyte or more for 1,000 per TB per year, less than a tenth of most other data warehousing solutions

A

redshift

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
\_\_\_\_\_ transaction example: net profit for EMEA and pacific for the digital radio product. pulls in large number of records sum of radios sold in EMEA sum of radios sold in pacific unit cost of radio in each region sales price of each radio salce price - unit cost
OLAP
26
olap
online analytics processing
27
REdshift can be configured as follows single node (160GB) multi node leader node (manages client connections and receives queries) computer node (store data and perform queries and computations) up to 128 compute nodes
yes
28
redshit advanced \_\_\_\_ columnar data stores can be compressed much more than row based data stores because similar data is stored sequentially on disk. redshift employs multiple compression techniques and can often achieve significant compression relative to traditional relational indexes or materialized views, and so uses less space than traditional relational database systems. when loading data into an empty table, redshift automatically samples your data and selects the most appropriate compression scheme.
compression
29
mpp =
massive parallel processing
30
\_\_\_ ___ \_\_\_ redshift automatically distributes data and query loads across all nodes. redshift makes it easy to add nodes to your data warehouse and enables you to maintain fast query performance as your data warehouse grows.
massively parallel processing
31
redshift backups enabled by default with a 1 day retention period max retention period is 35 days redshift always attempts to maintain at least three copies of your data (the original and replica on the compute nodes and a backup in s3) redshift can also asynchronously replicate your snapshots to s3 in another region for disaster recovery.
yes
32
redshift pricing compute node hours (total number of house ou run across all your compute nodes for the billing period. you are billed for 1 unit per node per hour, so a 3 node data warehouse cluster running persistently for an entire month would incur 2,160 instance hours. you will not be charged for leader node hours; only compute nodes will uncur charges.) charged for backups charged for data transfer (only within vpc, not outside it)
yes
33
redshift security considerations encrypted in transit using SQL encrypted at rest using AES-256 encryption by default redshift takes care of key management - manage your own keys through HSM - AWS key management service
yes
34
redshift availability \_\_\_ AZ(s) can restore snapshots to new AZs in event of an outage
1
35
What is aurora? it is a mysql and postgresql compatible _____ db engine that combines the speed and availability of high end commercial databases with the eimplicity and cost efefctiveness of open source databases.
relational
36
aurora provides up to ___ x better performance than mysql and \_\_\_x better than postgres dbs at a much lower price point, whilst delivering similar performance and availability
5, 3
37
THings to know about aurora 1. start with \_\_gb, scales in\_\_gb increments to \_\_\_tb (storage autoscaling) 2. compute resources can scale up to \_\_\_vCPUs and 244GB of RAM 3. ___ copies of your data is contained in each AZ, with max of ___ AZs. ___ copies of your data.
10,10,64 34 2, 3, 6
38
aurora is designed to transparently handle the loss of up to ___ copies of data without affecting db write avialbility and up to ___ copies without affecting read availability
2,3
39
t or f aurora storage is self healing. data blocks and disks are continuously scanned for errors and repaired automatically.
t
40
three types of aurora replicas are available: aurora replicas (how many?) mysql read replicas (how many?) postgresQL (how many?)
15, 5, 1
41
t or f backups are always enabled on aurora db instances
t
42
t or f backups impact db performance and must be done during slow traffic periods
false, they do not impact business
43
t or f aurora snapshots impact performance
f they do not impact performance
44
t or f aurora snapshots cannot be shared with other aws accounts
f they can
45
aurora ____ is an on demand autoscaling capable edition of aurora. an aurora ___ db cluster automatically starts up, shuts down, and scales capacity up or down based on your apps needs.
serverless
46
t or f aurora serverless provides a relatively simple, cost effective option for infrequent, intermittent, or unpredictable workloads
t
47
does memcached support simple cache to offload DB
yes
48
does memcached support ability to scale horizontally
YES
49
DOES MEMCAChed support multithreaded performance
yes
50
does memcached support advanced data types
no
51
does memcached support ranking/sorting data sets
no
52
does memcached support pub/sub capabilities
no
53
does memcached support persistence
no
54
does memcached support multi AZ
no
55
does memcached support backup and restore capabilities?
no
56
does redis support simple cache to offload DB
yes
57
does redis support ability to scale horizontally
yes
58
does redis support multi threaded performance
no
59
does redis support advanced data types
yes
60
does redis support ranking/sorting data sets
yes
61
does redis support ranking/sorting data sets
yes
62
does redis support pub/sub capabilities
yes
63
does redis support persistence?
yes
64
does redis support multi az?
yes
65
does redis support backup and restore capabilities?
yes
66
use ___ to increase DB and web application performance
elasticache
67
\_\_\_ ___ \_\_\_\_ is a cloud service that makes it easy to migrate relational databses, data warehouses, nosql dbs, and other types of data stores. you cna use ___ \_\_ ___ to migrate your data into the cloud, between on prem instances or between combinations of cloud and on prem setups.
database migration service (DMS)
68
SCT = ?
schema creation tool
69
t or f you need SCT even if you are migrating to identical databases
f you do not need sct if dbs are the same.
70
DMS - the source can either be on prem or inside aws iteself or another provider such as azure t or f
t
71
t or f dms allows you to migrate databses from one source to aws.
t
72
t or f you can do homogenous migrations(same db engines) or heterogenous migrations (different db engines) DMS
t
73
t or f if you do a heterogenous migration with dms, you will need the aws schema conversion tool
t
74
the follwing services have caching capabilities api gateway cloudfront elasticache - memcached and redis dynamodb accelerator (DAX)
yes
75
emr = ?
elastic map reduce
76
\_\_\_\_ is the industry leading cloud big data platform for processing vast amounts of data using open source tools such as apache spark, apache hive, hbase, flink, hudi, presto. with ____ you can run petabyte scale analysis at less than half the cost of traditional on prem solutions and over 3x faster than standard apache spark
emr
77
the central component of EMR is the \_\_\_\_\_\_
cluster
78
EMR match the nodes: master, core, task 1. a node w/ sw components that only runs tasks and does not store data in HDFS. they are optional 2. a node that manages the cluster. this node tracks the status of tasks and monitors the health of the cluster. every cluster has one. 3. a node with sw components that runs tasks and stores data in the hadoop distributed file system (HDFS) on your cluster. multinode clusters have at least one.
1 = task 2 = master 3 = core
79
emr archives log files to s3 at ___ minute intervals
5
80
emr log files are available even after the cluster terminates? t or f
t
81
emr - by default log data is stored on core node. t or f
f data is stored on master
82
t or f EMR you can configure replication to s3 on 5 min intervals for all log data from the master node, however, this can only be configured when creating the cluster for the first time.
t
83
mysql default port is \_\_\_
3306
84
When you add a rule to an RDS DB security group, you must specify a port number or protocol.
false a destination port is needed, but the rds instance port numbers is automatically applied to the rds db sg.
85
If you are using Amazon RDS Provisioned IOPS storage with a Microsoft SQL Server database engine, what is the maximum size RDS volume you can have by default?
16tb
86
What happens to the I/O operations of a single-AZ RDS instance during a database snapshot or backup?
I/O may be briefly suspended while the backup process initializes (typically under a few seconds), and you may experience a brief period of elevated latency.
87
In RDS, what is the maximum value I can set for my backup retention period?
35 days
88
89
90