Delta Live Tables Flashcards

1
Q

What is Delta Live Tables?

A

A managed data pipeline tool exclusive to DB
1/ETL Tool for data warehousing
2/Managed structured streaming for Spark

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What problem does Delta Live Tables solve?

A

1/Write pipelines in simple, declarative code
2/Easily implement and monitor data quality constraints
3/Easily track and view data lineage

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Why do customers care about Delta live tables?

A

Helps accelerate data pipeline development, reduces burden of performance tuning and maintenance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

When should you position Delta live tables with a customer?

A

If you see any data pipeline/ETL work in databricks,batch, or stream

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How do Delta Live Tables work?

A

1/Customers write DLT-specific SQL/Python code in a notebook
2/Once written, switch to workflows, click DLT and set up cluster to run the notebook
3/Once run DLT populates the UI with a data lineage diagram and writes out logs that are used for monitoring

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are key features of Delta Live Tables?

A

1/Automatic table optimization
2/Enhanced autoscaling for spiky workloads
3/Easily create test pipelines

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or false : DLT is only for streaming

A

False : Its continuous or triggered mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

True or False : DLT is low-code data engineering and not for advanced Spark users

A

False : DLT improves efficiency, making pipeline code simple, can handle optimization, testing, error-handling, monitoring, and documentation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What to look for when pitching DLT

A

1/Pipelines with unpredictable workloads
2/Complex pipelines with many downstream tables
3/Mention of data lineage and/or quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Competitors?

A

DBT, and any structured streaming

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Cost

A

It costs more than others but better performant. Starts at $0.20 DBU for Core

How well did you know this?
1
Not at all
2
3
4
5
Perfectly