End-to-End Analytics (Fabric) Flashcards
(56 cards)
What is OneLake?
Fabric’s lake-centric architecture that provides a single, integrated environment for data professionals and the business to collaborate on data projects.
What is OneCopy?
A key component of OneLake that allows you to read data from a single copy, without moving or duplicating data.
How does OneLake facilitate collaboration?
By eliminating the need to move and copy data between different systems and teams.
What is the analogy used to describe OneLake?
Think of it like OneDrive for data.
What does OneLake combine?
Storage locations across different regions and clouds into a single logical lake.
What features does Fabric’s data warehousing, data engineering, integration, intelligence, and Power BI use?
They all use OneLake as their native store without needing any extra configuration.
On what is OneLake built?
Azure Data Lake Storage (ADLS).
What formats can data be stored in OneLake?
Delta, Parquet, CSV, JSON, and more.
What is the benefit of storing data in OneLake?
Data stored in OneLake is directly accessible by all compute engines without needing to be moved or copied.
What format do analytical engines in Fabric write for tabular data?
Delta-parquet format.
What is a key feature of OneLake regarding shortcuts?
Shortcuts are embedded references within OneLake that point to other files or storage locations.
What does the Synapse Data Engineering experience focus on?
Data engineering with a Spark platform for data transformation at scale.
What is the purpose of the Synapse Data Warehouse experience?
Data warehousing with industry-leading SQL performance and scale to support data use.
What does the Synapse Data Science experience utilize?
Azure Machine Learning and Spark for model training and execution tracking in a scalable environment.
What is the focus of Synapse Real-Time Intelligence?
Real-time intelligence to query and analyze large volumes of data in real-time.
What does the Data Factory experience combine?
Power Query with the scale of Azure Data Factory to move and transform data.
What is the role of Power BI in Fabric?
Business intelligence for translating data to decisions through interactive reports.
What is the function of workspaces in Microsoft Fabric?
They serve as logical containers that help you organize and manage your data, reports, and other assets.
How do workspaces support collaboration?
By providing a clear separation of resources, making it easier to control access and maintain security.
What can be customized within workspaces?
How to separate and control access to the items.
What features do workspaces support for managing compute resources?
Settings to manage compute resources and integrate with Git for version control.
What does Git integration in workspaces allow?
To track changes, collaborate on code, and maintain a history of your work.
What is the significance of data lineage and impact analysis in workspaces?
They provide a comprehensive view of data flow and dependencies, enhancing transparency and decision-making.
How is security managed in OneLake?
Data is secured and governed in one place, allowing users to easily find and access the data they need.