{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

Data Engineering Part 3 Flashcards

(15 cards)

1
Q

What is a relational database?

A

A database that stores data in tables with rows and columns using SQL.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is SQL?

A

Structured Query Language, used for managing and querying relational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a primary key?

A

A unique identifier for rows in a table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a foreign key?

A

A reference to a primary key in another table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is normalization in databases?

A

Organizing data to reduce redundancy and improve integrity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a data warehouse?

A

A central repository for structured data used for analysis and reporting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How does a data warehouse differ from a database?

A

Warehouses are optimized for analysis; databases are optimized for transactions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is OLAP?

A

Online Analytical Processing — used for complex analytical queries in warehouses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is OLTP?

A

Online Transaction Processing — used for frequent, simple transactions in operational databases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Name examples of cloud data warehouses.

A

Amazon Redshift, Google BigQuery, Snowflake.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a data lake?

A

A storage system that holds raw structured and unstructured data at scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is a data lake different from a data warehouse?

A

Lakes store raw, unprocessed data; warehouses store processed, structured data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are typical file formats in a data lake?

A

Parquet, Avro, ORC.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is schema-on-read?

A

Schema is applied when the data is read, not when it’s written.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a data swamp?

A

A poorly managed data lake with inaccessible or low-quality data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly