Course 1| Module 1| Learning Objective 1 Flashcards

Introduction to Data Engineering (26 cards)

1
Q

What are the various sources of data mentioned?

A

Data resides in:
* text
* images
* videos
* clickstreams
* user conversations
* social media platforms
* IoT devices
* real-time events
* legacy databases
* professional data providers and agencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the first step when working with different sources of data?

A

Pull a copy of the data from the original sources into a data repository

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What challenges are faced when acquiring data?

A

Challenges include reliability, security, and integrity of the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the role of Data Engineers?

A

Data Engineers develop and maintain data architectures and make data available for business operations and analysis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What tasks do Data Engineers perform?

A

Tasks include:
* Extracting, integrating, and organizing data
* Cleaning, transforming, and preparing data
* Designing, storing, and managing data in repositories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What knowledge is required for a Data Engineer?

A

Good knowledge of programming, systems and technology architectures, and understanding of relational and non-relational datastores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the main responsibility of a Data Analyst?

A

Translate data and numbers into plain language for organizational decision-making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What skills are essential for Data Analysts?

A

Skills include:
* Knowledge of spreadsheets
* Writing queries
* Using statistical tools
* Strong analytical and storytelling skills

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do Data Scientists analyze?

A

Data Scientists analyze data for actionable insights and build predictive models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What knowledge is required for Data Scientists?

A

Knowledge of Mathematics, Statistics, programming languages, databases, and building data models

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the role of Business Analysts?

A

Leverage the work of Data Analysts and Data Scientists to recommend actions based on implications for the business

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What do Business Intelligence Analysts focus on?

A

Market forces and external influences that shape the business

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Summarize the relationship between Data Engineering, Data Analytics, and Data Science.

A

Data Engineering converts raw data into usable data; Data Analytics generates insights; Data Scientists predict future outcomes using past data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What do Data Warehouse Engineers do?

A

Design, build, and maintain data warehouses for business intelligence and reporting

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the role of Data Architects?

A

Design the overall architecture for data management systems and define strategies for data integration, governance, and security

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What responsibilities do Data Managers have?

A

Oversee governance and strategy of data, ensuring quality, compliance, and accessibility

17
Q

What tasks are performed by Database Administrators?

A

Tasks include:
* Conducting routine backups
* Optimizing performance
* Managing security patches
* Monitoring database activity

18
Q

Fill in the blank: The field of Data Engineering concerns itself with the ______.

A

[mechanics for the flow and access of data]

19
Q

What is the goal of Data Engineering?

A

To make quality data available for fact-finding and data-driven decision making

20
Q

What includes the process of collecting required data?

A

Developing tools, workflows, and processes to acquire data from multiple sources

21
Q

What types of data repositories can data be stored in?

A

Data can be stored in:
* Databases
* Data warehouses
* Data lakes
* Other types of data repositories

22
Q

What does processing data involve?

A

Cleaning, transforming, and preparing data so that it is usable

23
Q

Which emerging technology has made it possible for every enterprise to have access to limitless storage and high-performance computing?

A

Cloud Computing

24
Q

A modern data ecosystem includes a network of continually evolving entities. It includes?

A

Data sources, enterprise data repository, business stakeholders, and tools, applications, and infrastructure to manage data

25
The goal of data engineering is to make quality data available for fact-finding and decision-making. Which statement captures the process of data engineering?
Collecting, processing, storing, and making data available to users securely
26
The three emerging technologies that are shaping today’s data ecosystem.
Cloud Computing, Machine Learning, and Big Data