Implement and manage an analytics solution Flashcards
(64 cards)
What is the benefit of storing different layers of your lakehouse in separate workspaces?
It can enhance security, manage capacity use, and optimize cost-effectiveness.
You want to use Apache Spark to explore data interactively in Microsoft Fabric. What should you create?
A notebook.
You need to use Spark to analyze data in a CSV file. What’s the simplest way to accomplish this goal?
Load the file into a dataframe.
Which method is used to split the data across folders when saving a dataframe?
partitionBy
Which type of table should an insurance company use to store supplier attribute details for aggregating claims?
Dimension table.
What is a semantic model in the data warehouse experience?
A semantic model is a business-oriented data model that provides a consistent and reusable representation of data across the organization.
What is the purpose of item permissions in a workspace?
To grant access to individual warehouses for downstream consumption.
Which language is optimized for querying real-time data in an eventhouse?
KQL
Which Microsoft Fabric Real-Time Intelligence component is used to visualize and explore real-time data in tiles?
Real-Time Dashboards
What is the primary function of Microsoft Fabric Eventstreams?
Ingesting and transforming real-time data.
What is the purpose of the Fabric Activator destination in an eventstream?
Data sent to an Activator destination can be used to trigger an automated action based on data values.
What is the primary language used for querying a data warehouse?
SQL
Why is indexing important in a data warehouse?
It speeds up data retrieval times.
What is the purpose of a fact table in a data warehouse?
To store the results of calculations.
What is the primary advantage of Dynamic Data Masking (DDM)?
It limits data exposure by obscuring sensitive information in real time.
What is the purpose of a security predicate function in Row-Level Security (RLS)?
It determines whether a row is accessible to a user based on certain conditions.
What happens when a user is granted a permission and then denied the same permission in a warehouse?
The DENY always supersedes the GRANT, and the user is denied access to the specific object.
What is the role of Git in the CI/CD process in Fabric?
Git lets your team collaborate using branches, and provides version control. It helps manage incremental code changes, and see code history.
What is the purpose of connecting a Fabric workspace to a Git repository?
To sync content between the workspace and Git, ensuring they have the same content.
What is the primary function of deployment pipelines in Fabric?
Deployment pipelines automate the movement of content through the development, test, and production stages.
What are the 3 security levels in Fabric’s security model and their order of evaluation? They are evaluated sequentially to determine whether a user has data access.
- Microsoft Entra ID authentication: checks if the user can authenticate to the Azure identity and access management service, Microsoft Entra ID.
- Fabric access: checks if the user can access Fabric.
- Data security: checks if the user can perform the action they’ve requested on a table or file.
What is order of evaluation of access in Fabric?
Microsoft Entra ID authentication, Fabric access, Data security
What workspace role should be assigned to a data engineer who needs to create Fabric items and read all data in an existing lakehouse?
Contributor
Which of the following tools can be used to apply granular data access permissions in Fabric?
OneLake data access roles