S3 Select & Glacier Select Flashcards

1
Q

What is Amazon S3 Select?

A

Amazon S3 Select is a feature that permits retrieving only a subset of data from an object stored in S3, using simple SQL expressions. This feature optimizes the retrieval process by reducing the amount of data that needs to be loaded and processed by your application.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How does Amazon S3 Select improve performance?

A

S3 Select improves performance by allowing applications to retrieve only the specific data they need from an S3 object, which minimizes network bandwidth, reduces the amount of data that needs to be processed, and speeds up query execution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

In what scenarios is S3 Select most beneficial?

A

S3 Select is most beneficial in scenarios where you need to quickly access smaller amounts of data from large objects stored in S3, such as log file analysis, data exploration, or filtering subsets of data for processing in big data jobs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How do you use Amazon S3 Select?

A

To use Amazon S3 Select, you issue SQL-based queries directly on S3 objects through the AWS SDKs, the AWS CLI, or the S3 REST API. You specify the object to query and the SQL expression to execute.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What file formats does Amazon S3 Select support?

A

Amazon S3 Select supports CSV, JSON, and Parquet formats. It allows querying these formats directly within S3 without requiring the entire object to be downloaded first.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Amazon Glacier Select?

A

Amazon Glacier Select is a feature that allows querying data archived in Amazon Glacier using standard SQL queries, enabling data retrieval without restoring the entire archive. It’s designed for querying non-urgent, archived data that is infrequently accessed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does Amazon Glacier Select differ from S3 Select?

A

While both features allow for querying data using SQL expressions, S3 Select operates on active data stored in S3, whereas Glacier Select queries data archived in Glacier. Glacier Select is used for deeper, long-term archival with a focus on cost savings but slower retrieval times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are the costs associated with using S3 Select and Glacier Select?

A

Costs for using S3 Select and Glacier Select include charges for the amount of data scanned by the query, the amount of data returned by the query, and the number of queries executed. These costs are separate from the storage costs of S3 and Glacier.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Can Glacier Select queries run on data archived in Glacier Deep Archive?

A

No, Glacier Select queries can run only on data stored in Glacier, not on data archived in Glacier Deep Archive. Glacier Deep Archive is designed for the lowest cost storage and does not support immediate querying with Glacier Select.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What use cases does Glacier Select support?

A

Glacier Select supports use cases involving data analysis, compliance checks, or content discovery on data that isn’t required to be accessed frequently and is archived in Glacier. It’s useful for businesses that need to query archived data for specific information without retrieving the full archive.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly