Domain 3: Processing Flashcards

1
Q

What is the main goal of the Processing phase of Data Analytics?

A
  • To transform the data and make it more consumable by the analytics and visualization tools.
  • Choosing the right ETL tool based on workload requirements, performance, cost, and orchestration needs
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the goals of ETL?

A

ETL helps you transform, flatten, and enrich your data from source to target. The goal is to:

  • Handle terabytes and petabytes of data
  • Simplify querying disparate data sets
  • Combine existing and legacy data with modern data sets
  • Prepare data for machine learning
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are some of the issues you face with ETL?

A
  • Data volumes may grow
  • Additional data sources may be added
  • A large proportion of ETL is hand-coded
  • Data formats may change over time
  • Target schemas may change
How well did you know this?
1
Not at all
2
3
4
5
Perfectly