Databases and Analytics Flashcards

1
Q

Amazon Relational Database Service (RDS)

A
  • RDS uses EC2 instances, so you must choose an instance
    family/type
  • Relational databases are known as Structured Query Language
    (SQL) databases
  • RDS is an Online Transaction Processing (OLTP) type of database
  • Easy to setup, highly available, fault tolerant, and scalable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

RDS Encryption

A
  • Can encrypt your Amazon RDS instances and snapshots at rest
  • Encryption uses AWS Key Management Service (KMS)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

RDS DB support types?

A

SQL Server, Oracle, MySQL Server, PostgreSQL, Aurora,
MariaDB

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

RDS scaling measures and DR?

A
  • Scales up by increasing instance size (compute and storage)
  • Read replicas option for read heavy workloads (scales out for
    reads/queries only)
  • Disaster recovery with Multi-AZ option
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Amazon Aurora

A
  • Amazon Aurora is an AWS database offering in the RDS
    family
  • Amazon Aurora is a MySQL and PostgreSQLcompatible relational database built for the cloud
  • Amazon Aurora features a distributed, fault-tolerant, self healing storage system that auto-scales up to 128TB per database instance
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Amazon DynamoDB

A
  • Fully managed NoSQL database service
  • Key/value store and document store
  • It is a non-relational, key-value type of database
  • Fully serverless service
  • Push button scaling
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Amazon DynamoDB features and benefits

A

Serverless - Fully managed, fault tolerant, service
Highly available - 99.99% availability SLA – 99.999% for Global Tables
NoSQL type of database with Name / Value
structure - Flexible schema, good for when data is not well structured or unpredictable
Horizontal scaling - Seamless scalability to any scale with push button scaling or Auto Scaling
DynamoDB Accelerator (DAX) - Fully managed in-memory cache for DynamoDB that increases performance (microsecond latency)
Backup - Point-in-time recovery down to the second in last 35 days; On-demand backup and restore
Global Tables - Fully managed multi-region, multi-master solution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Amazon RedShift

A
  • RedShift is a SQL based data warehouse used for analytics
    applications
  • RedShift is a relational database that is used for Online
    Analytics Processing (OLAP) use cases
  • RedShift uses Amazon EC2 instances, so you must choose an
    instance family/type
  • RedShift always keeps three copies of your data
  • RedShift provides continuous/incremental backups
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Amazon EMR

A
  • Managed cluster platform that simplifies running big data
    frameworks including Apache Hadoop and Apache Spark
  • Used for processing data for analytics and business
    intelligence
  • Can also be used for transforming and moving large amounts
    of data
  • Performs extract, transform, and load (ETL) functions
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Amazon ElastiCache

A
  • Fully managed implementations Redis and Memcached
  • ElastiCache is a key/value store
  • In-memory database offering high performance and low
    latency
  • Can be put in front of databases such as RDS and DynamoDB
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Amazon Athena

A
  • Athena queries data in S3 using SQL
  • Can be connected to other data sources with Lambda
  • Data can be in CSV, TSV, JSON, Parquet and ORC formats
  • Uses a managed Data Catalog (AWS Glue) to store
    information and schemas about the databases and tables
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

AWS Glue

A
  • Fully managed extract, transform and load (ETL) service
  • Used for preparing data for analytics
  • AWS Glue runs the ETL jobs on a fully managed, scale-out
    Apache Spark environment
  • Works with data lakes (e.g. data on S3), data warehouses
    (including RedShift), and data stores (including RDS or EC2
    databases)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Amazon Kinesis Data Streams

A
  • Producers send data which is stored in shards for up to 7
    days
  • Consumers process the data and save to another service
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Amazon Kinesis Data Firehose

A
  • No shards, completely automated and elastically scalable
  • Saves data directly to another service such as S3, Splunk,
    RedShift, or Elasticsearch
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Amazon Kinesis Data Analytics

A
  • Provides real-time SQL processing for streaming data
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS Data Pipeline

A

AWS Data Pipeline
* Processes and moves data between different AWS compute and
storage services
* Save results to services including S3, RDS, DynamoDB, and EMR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Amazon QuickSight

A

Amazon QuickSight
* Business intelligence (BI) service
* Create and publish interactive BI dashboards for Machine
Learning-powered insights

18
Q

Amazon Neptune

A

Amazon Neptune
* Fully managed graph database service

19
Q

Amazon DocumentDB

A

Amazon DocumentDB
* Fully managed document database service (non-relational)
* Supports MongoDB workloads
* Queries and indexes JSON data

20
Q

Amazon QLDB

A
  • Fully managed ledger database for immutable change history
  • Provides cryptographically verifiable transaction logging
21
Q

Amazon Managed Blockchain

A
  • Fully managed service for joining public and private networks
    using Hyperledger Fabric and Ethereum
22
Q

AWS Migration Hub

A
  • Provides a single location to track the progress of application
    migrations across multiple AWS and partner solutions
23
Q

AWS Database Migration Service (DMS)

A
  • AWS Database Migration Service helps you migrate
    databases to AWS quickly and securely.
  • The source database remains fully operational during the
    migration, minimizing downtime to applications that rely on
    the database
24
Q

AWS Server Migration Service (SMS)

A
  • Migrates servers and virtual machines to Amazon EC2
  • Agentless service which makes it easier and faster for you to
    migrate thousands of on-premises workloads to AWS
  • Automate, schedule, and track incremental replications of
    live server volumes
25
AWS DataSync
* Online data transfer service * Transfer data between on-premises and AWS storage services
26
Snowball Family
* AWS Snowball and Snowmobile are used for migrating large volumes of data to AWS Uses a secure storage device for physical transportation * Snowball (80TB) (50TB ) “petabyte scale” * Snowball Edge (100TB) “petabyte scale” * Snowmobile – “exabyte scale” with up to 100PB per Snowmobile Types Snowball Edge Compute Optimized Snowball Edge Storage Optimized Snowcone
27
Snowball Edge Compute Optimized
* Provides block and object storage and optional GPU * Edge computing use cases
28
Snowball Edge Storage Optimized
* Provides block storage and Amazon S3-compatible object storage * Use for local storage and large-scale data transfer
29
Snowcone
* Small device used for edge computing, storage and data transfer * Can transfer data offline or online with AWS DataSync agent
30
AWS Rekognition
* Add image and video analysis to your applications * Identify objects, people, text, scenes, and activities in images and videos
31
Amazon Transcribe
* Add speech to text capabilities to applications * Recorded speech can be converted to text before it can be used in applications
32
Amazon Translate
* Neural machine translation service that delivers fast, highquality, and affordable language translation * Localize content such as websites and applications for your diverse users
33
Amazon Sagemaket
* Helps data scientists and developers to prepare, build, train, and deploy high-quality machine learning (ML) models
34
Amazon Comprehend
* Natural-language processing (NLP) service * Uses machine learning to uncover information in unstructured data
35
Amazon Lex
* Conversational AI for Chatbots * Build conversational interfaces into any application using voice and text
36
Amazon Polly
* Turns text into lifelike speech * Create applications that talk, and build entirely new categories of speech-enabled products
37
Amazon Workspaces
* Managed Desktop-as-a-Service (DaaS) solution * Provision either Windows or Linux desktops
38
AWS AppStream 2.0
* Fully managed non-persistent application streaming service * Alternative to popular products such as Citrix XenApp
39
AWS Worklink
* Provides secure, one-click access to your internal websites and web apps using mobile phone browsers * Does not require VPN client or App
40
AWS WorkDocs
* Fully managed, secure content creation, storage, and collaboration service * Create, edit, and share content that’s centrally stored on AWS
41
AWS IoT Core
* Lets you connect IoT devices to the AWS cloud without the need to provision or manage servers