Advanced Amazon S3 Flashcards

1
Q

You can transition ________ between storage classes

A

objects

  • For infrequently accessed object, move them to standard IA
  • For archive objects that you don’t need fast access to, move them to glacier or glacier deep archive
  • Moving object can be automated using a lifecycle rules
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are some features of Amazon S3 - Lifecycle rules?

A
  1. Transition actions: configure object to transition to another storage class
    - Move objects to standard IA class 60 days after creations
    - Move to glacier for archiving after 6 months
  2. Expiration actions: configure object to expire (delete) after some time
  • Access log files can be set to delete after a 365 days
  • Can be used to delete old version of files (if versioning is enabled)
  • Can be use to delete incomplete multi part uploads
  • Rules can be create for a certain prefix (ex. S3://mybucket/MP3’s/*)
  • Rules can be created for certain objects tags (ex. Department finance)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

__________ help you decide when to transition objects to the right storage class

A

Amazon S3 analytics - Store Class analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some features of Amazon S3 Analytics- storage class analysis

A
  • Recommendations for standard & standard IA
  • Does not work for one-zone IA or glacier
  • Report is updated daily
  • 24-48 hours to start seeing data analysis
  • Good first step to put together lifecycle rules (or improve them)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are some features of S3- Requester Pays?

A
  1. In General bucket owners pay for all Amazon S3 storage & data transfer costs associated with their bucket
  2. With requester pays buckets the requester instead of the bucket owner pays the cost of the request and the data download from the bucket
  3. Helpful when you want to share large datasets with other accounts
  4. The requester must be authenticated in AWS (can’t be anonymous)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are some features of S3 event notifications ?

A
  1. S3: Objectcreated, S3: objectremoved
    - S3:Objectrestore, S3: replication
  2. Object Name filtering possible (*.jpg)
  3. Use case: generate thumbnails of images uploaded to S3
  4. Can create as many “S3 events” as desired
  5. S3 Event notifications typically deliver events in seconds but can sometimes take a minute or longer
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

For S3 events notifications to work, need to have __________

A

IAM permissions (& need to attach an SNS resource (access) IAM policy) & if you use SQS need to have SQS resource (access) IAM policy & if you use a lambda then need to have lambda resource policy

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Having ___________ with S3 event notification can give you advanced filtering options with JSON rules

A

Amazon EventBridge

Also get: Multi destinations: ex step functions, kinesis streams/ firehose

EventBridge capabilities
- archive, replay events reliable delivery

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Why are some features of S3 baseline performance ?

A
  1. Amazon S3: automatically scales to high request rates, latency 100-200ms
  2. Your application can archive at least 3,500 PUT/COPY/POST/DELETE or 5,500’GET/HEAD request per second per prefix in a bucket
  • There is no limit to the number of prefixes in a bucket
  • So if you spread reads across all four prefixed evenly, you can achieve 22,000 request per second for GET and HEAD
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are some ways to optimize S3 performance?

A
  1. Multi Part Upload:
    - recommend for files > 100MB
    must use for files > 5GB
    - Can help parallelize uploads (speed up transfers)
  2. S3 Transfer acceleration
    - Increase transfers speed by transferring file to an AWS edge location which will forward the data to the S3 bucket in the target region
  • Compatible with multi-part upload
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How can S3 performance read /get files in the most efficient way?

A

Can use S3 Byte - Range fetches
- Parallele GETs by requesting specific byte ranges

  • Better resilience in case of failures
  • Can be used to speed up downloads
  • Can be used to retrieve only partial data (for example the head of a file)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are some features of S3 select & glacier select?

A
  1. Retriever less data using SQL by performing sever-side filtering
  2. Can filter by rows & columns (simple SWL statements)
  3. Less network, transfer less CPU cost client side
How well did you know this?
1
Not at all
2
3
4
5
Perfectly