AWS Athena Flashcards

1
Q

What are the data formats Athena can work with?

A
  • XML
  • JSON
  • CVS
  • TSV
  • AVRO
  • ORC
  • PARQUET
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is AVRO file format?

A

Avro is a row-based storage format for Hadoop

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is TSV file format?

A

TSV is a file extension for a tab-delimited file used with spreadsheet software

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is PARQUET file format

A

Parquet, an open-source file format for Hadoop.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is ORC file format?

A

The Optimized Row Columnar (ORC) file format provides a highly efficient way to store Hive data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What does Athena enable you to do?

A

It enables you to perform SQL like queries on S3

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the Athena data catalogue used for?

A

It is used to store you schema (the view over the date)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Athena data catalogue used for?

A

It is used to store the schema (view)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How can you save money in Athena?

A

Using compressed file formats?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are you charged for in Athena?

A

S3 storage and every 5$ per TB of data processed

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do I create the Athena server cluster to run Athena?

A

You do not, this is a question to make sure you are awake.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What type of data is more efficient to process in Athena?

A

Colum formatted data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the query language used in Athena?

A

SQL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How can I easily query geo spacial information?

A

There are many public data sets available like health, geospatial, weather. This data is often available in S3 or sometimes need to be copied. You can then use Athena to query this data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do I create a schema table in Athena?

A

You run a SQL query to create the tables and then you can sun other SQL queries over these tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

I wnat to query the huge amounts of cloud trail logs, how can I do this?

A

Store the cloudtrail logs in S3 and use Athena to query the logs.

17
Q

Will Athena modify the source data?

A

No, it creates a schema and performs queries through the schema and in a read-only way.

18
Q

Dose Athena support views of the data?

A

Yes this was added to Athena

19
Q

Are Athena views materialized?

A

No, they are created each time the query is run.