AWS Certified Solutions Architect - Associate Flashcards

1
Q

SHARDS

A

Think Kinesis Streams. It is the only form of Kinesis that uses shards.
Default is storage in shards for 24hrs. Can be configured up to days

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Kinesis Firehose

A

NO PERSISTENT STORAGE (like streams>shards). Data has to be analysed as it comes in.
Can use lambda functions. Data comes in, triggers lambda then outputs to s3 or Redshift (although not directly to Redshift. Needs to go to s3 first then import in to Redshift) or Elasticsearch cluster and Splunk

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

SQS - FIFO Queue

A

First-in First-Out Delivery
Exactly Once Processing
Limited to 300 transactions per second

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

SQS

A

Is PULL based (not pushed = SNS)
Messages are 256KB in size
Messages can be kept in the queue from 1 minute to 14 days
Default retention is 4 days Important to remember

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

SQS - Visibility Time Out

A

The amount of time that a msg is invisible in the Q after a reader picks up that msg.
DEFAULT 30 seconds
MAXIMUM is 12HRS
Provided the job is processed before the visibility timeout expires, the msg will then be deleted from the Q.
If not processed in time. The msg will become visible again and another reader will process it.
This CAN result in the same msg being delivered twice/

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

SQS Polling

A
SHORT POLLING - returns immediately even if Q is empty
LONG POLLING (Saves Money) - doesn't return a response until a  message arrive sin the Q or the long poll times out.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

SWF (if TASK is mentioned think SWF)

A

Workflow executions can last up to 1 year
SWF presents a task-oriented API (whereas SQS offers a msg oriented API)
SWF ensures a task is assigned only once and never duplicated
SWF keeps track of all tasks/events in an app (For SQS you need to implement your own tracking)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

SWF

  • WORKFLOW STARTERS
  • DECIDERS
  • ACTIVITY WORKERS
A

Workflow Starters - an app that can initiate a workflow. Could be an e-commerce website following the placement of an order, or a mobile app searching for bus times

DECIDERS - control the flow of activity tasks in a workflow execution. If something has finished/failed in a workflow, a decider decides what to do next.

ACTIVITY WORKERS - carry out the activity tasks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

SNS vs SQS

A

Both are messaging systems
SNS - Push
SQS - Polls (Pulls)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Cross-origin resource sharing (CORS)

A

a mechanism that allows restricted resources (e.g. fonts) on a web page to be requested from another domain outside the domain from which the first resource was served.
CORS is enforced by the client.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

EXAM Q - If you see something along the lines of “Origin policy cannot be read at the remote resource”

A

You need to ENABLE CORS on API Gateway

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Attaching a network interface to an EC2 instance

A

You can attach a network interface to an EC2 instance in the following ways:

1. When it's running (hot attach)
2. When it's stopped (warm attach)
3. When the instance is being launched (cold attach).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

IAM database authentication

A

IAM database authentication is only supported in MySQL and PostgreSQL database engines.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

AWS X-Ray

A

usedto trace and analyze user requests as they travel through your Amazon API Gateway APIs to the underlying services.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Glacier Select

A

is not a storage service.It is primarily used to run queries directly on data stored in Amazon Glacier, retrieving only the data you need out of your archives to use for analytics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

AWS Glue

A

is a fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and load their data for analytics.

17
Q

AWS OpsWorks

A

is a configuration management service that provides managed instances of Chef and Puppet. Chef and Puppet are automation platforms that allow you to use code to automate the configurations of your servers. OpsWorks lets you use Chef and Puppet to automate how servers are configured, deployed, and managed across your Amazon EC2 instances or on-premises compute environments.

18
Q

AWS Trusted Advisor

A

analyzes your AWS environment and provides best practice recommendations in these five categories:
Cost Optimization,
Performance,
Fault Tolerance,
Security,
andService Limits.
You can use a mnemonic, such as CPFSS, to memorize these five categories.

19
Q

Gateway endpoint

A

for Amazon S3 and DynamoDB service, you have to create a gateway endpoint and then use an interface endpoint for other services.

20
Q

How will I be charged if my Spot instance is interrupted?

A

If your Spot instance is terminated or stopped by Amazon EC2 in the first instance hour, you will not be charged for that usage. However, if you terminate the instance yourself, you will be charged to the nearest second. If the Spot instance is terminated or stopped by Amazon EC2 in any subsequent hour,you will be charged for your usage to the nearest second. If you are running on Windows and you terminate the instance yourself, you will be charged for an entire hour.

21
Q

Amazon S3 notification feature

A

Amazon S3 supports the following destinations where it can publish events:

Amazon Simple Notification Service (Amazon SNS) topic -A web service that coordinates and manages the delivery or sending of messages to subscribing endpoints or clients.

Amazon Simple Queue Service (Amazon SQS) queue -Offers reliable and scalable hosted queues for storing messages as they travel between computer.

AWS Lambda -AWS Lambda is a compute service where you can upload your code and the service can run the code on your behalf using the AWS infrastructure. You package up and upload your custom code to AWS Lambda when you create a Lambda function.

22
Q

Data pipeline

A

is primarily used as a cloud-based data workflow service that helps you process and move data between different AWS services and on-premises data sources. It is not suitable for collecting data from distributed sources such as users, IoT devices, or clickstreams.

23
Q

Amazon Redshift Spectrum

A

is a feature of Amazon Redshift that enables you to run queries against exabytes of unstructured data in Amazon S3 with no loading or ETL required.

24
Q

Snowball vs Snowball Edge

A

Although an AWS Snowball device costs less than AWS Snowball Edge, it cannot store 80 TB of data in one device. Take note that the storage capacity is different from the usable capacity for Snowball and Snowball Edge. Remember that an 80 TB Snowball appliance and 100 TB Snowball Edge appliance only have 72 TB and 83 TB of usable capacity respectively. Hence, it would be costly if you use two Snowball devices compared to using just oneAWS Snowball Edge device.

25
Q

DynamoDB auto scaling

A

uses the AWS Application Auto Scaling service to dynamically adjust provisioned throughput capacity on your behalf, in response to actual traffic patterns. This enables a table or a global secondary index to increase its provisioned read and write capacity to handle sudden increases in traffic, without throttling. When the workload decreases, Application Auto Scaling decreases the throughput so that you don’t pay for unused provisioned capacity.

26
Q

Amazon Redshift Enhanced VPC Routing

A

When you use Amazon Redshift Enhanced VPC Routing, Amazon Redshift forces all COPY and UNLOAD traffic between your cluster and your data repositories through your Amazon VPC. By using Enhanced VPC Routing, you can use standard VPC features, such as VPC security groups, network access control lists (ACLs), VPC endpoints, VPC endpoint policies, internet gateways, and Domain Name System (DNS) servers.

27
Q

Redshift Spectrum

A

is primarily used to run queries against exabytes of unstructured data in Amazon S3, with no loading or ETL required.

28
Q

Amazon EMR

A

provides you a managed Hadoop framework that makes it easy, fast, and cost-effective to process vast amounts of data across dynamically scalable Amazon EC2 instances. You CAN ACCESS the operating system of these EC2 instances that were created by Amazon EMR.

29
Q

AWS Step Functions

A

provides serverless orchestration for modern applications. Orchestration centrally manages a workflow by breaking it into multiple steps, adding flow logic, and tracking the inputs and outputs between the steps. As your applications execute, Step Functions maintains application state, tracking exactly which workflow step your application is in, and stores an event log of data that is passed between application components. That means that if networks fail or components hang, your application can pick up right where it left off.

SWF does not provide serverless orchestration to multiple AWS resources

30
Q

CloudTrail

A

By default, CloudTrail event log files are encrypted using Amazon S3 server-side encryption (SSE). You can also choose to encrypt your log files with an AWS Key Management Service (AWS KMS) key. You can store your log files in your bucket for as long as you want. You can also define Amazon S3 lifecycle rules to archive or delete log files automatically. If you want notifications about log file delivery and validation, you can set up Amazon SNS notifications.