Course 1| Module 1| Learning Objective 1 Flashcards
Introduction to Data Engineering (26 cards)
What are the various sources of data mentioned?
Data resides in:
* text
* images
* videos
* clickstreams
* user conversations
* social media platforms
* IoT devices
* real-time events
* legacy databases
* professional data providers and agencies
What is the first step when working with different sources of data?
Pull a copy of the data from the original sources into a data repository
What challenges are faced when acquiring data?
Challenges include reliability, security, and integrity of the data
What is the role of Data Engineers?
Data Engineers develop and maintain data architectures and make data available for business operations and analysis
What tasks do Data Engineers perform?
Tasks include:
* Extracting, integrating, and organizing data
* Cleaning, transforming, and preparing data
* Designing, storing, and managing data in repositories
What knowledge is required for a Data Engineer?
Good knowledge of programming, systems and technology architectures, and understanding of relational and non-relational datastores
What is the main responsibility of a Data Analyst?
Translate data and numbers into plain language for organizational decision-making
What skills are essential for Data Analysts?
Skills include:
* Knowledge of spreadsheets
* Writing queries
* Using statistical tools
* Strong analytical and storytelling skills
What do Data Scientists analyze?
Data Scientists analyze data for actionable insights and build predictive models
What knowledge is required for Data Scientists?
Knowledge of Mathematics, Statistics, programming languages, databases, and building data models
What is the role of Business Analysts?
Leverage the work of Data Analysts and Data Scientists to recommend actions based on implications for the business
What do Business Intelligence Analysts focus on?
Market forces and external influences that shape the business
Summarize the relationship between Data Engineering, Data Analytics, and Data Science.
Data Engineering converts raw data into usable data; Data Analytics generates insights; Data Scientists predict future outcomes using past data
What do Data Warehouse Engineers do?
Design, build, and maintain data warehouses for business intelligence and reporting
What is the role of Data Architects?
Design the overall architecture for data management systems and define strategies for data integration, governance, and security
What responsibilities do Data Managers have?
Oversee governance and strategy of data, ensuring quality, compliance, and accessibility
What tasks are performed by Database Administrators?
Tasks include:
* Conducting routine backups
* Optimizing performance
* Managing security patches
* Monitoring database activity
Fill in the blank: The field of Data Engineering concerns itself with the ______.
[mechanics for the flow and access of data]
What is the goal of Data Engineering?
To make quality data available for fact-finding and data-driven decision making
What includes the process of collecting required data?
Developing tools, workflows, and processes to acquire data from multiple sources
What types of data repositories can data be stored in?
Data can be stored in:
* Databases
* Data warehouses
* Data lakes
* Other types of data repositories
What does processing data involve?
Cleaning, transforming, and preparing data so that it is usable
Which emerging technology has made it possible for every enterprise to have access to limitless storage and high-performance computing?
Cloud Computing
A modern data ecosystem includes a network of continually evolving entities. It includes?
Data sources, enterprise data repository, business stakeholders, and tools, applications, and infrastructure to manage data