Data Modeling Flashcards
Design a data model, create DAX measures, and optimize performance (31 cards)
What is a data model
A queryable data resource thats optimized for analytics
What are the two analytic query languages?
Data Analysis Expressions (DAX)
Multidimensional Expressions (MDX)
What is a Power BI dataset?
A Power BI model that is published a workspace in the service
Analytic Query
A query against a model that produces a result that easy for a person to understand
Tabular Model
A model that is made up of one or more tables of columns
Star Schema Design
A modeling approach where you have your fact table in the middle and you dimension tables on the points of the star
Table Storage Mode
a property for every table (except calculated tables) that determines if the data is store in the model or left in the source
3 Table Storage Modes
Import
DirectQuery
Dual
Import Storage Mode
Queries data that is cached in the model
DirectQuery Storage Mode
Queries pass through to the data source
Dual Storage Mode
Queries stored data or passes through to the data source. Power BI determines the most efficient plan striving to use cached data whenever possible
What is a model framework?
Its a combination of table storage mode settings
3 Model Frameworks
Import Model
DirectQuery Model
Composite Model
Import Model Framework
A model where the tables have their storage mode property set to Import
DirectQuery Model Framework
A model where tables have their storage mode property set to DirectQuery and they belong to the same Source Group
Composite Model Framework
A model that comprises more than one source group
Source Group
its to combination of the data source and the connection used
What does it mean to have more than one source group?
you have both import and direct query source groups being used or you are using a direct query connection on 2 different data sources
Whats the key difference between an import model framework, direct query model framework, and a composite model framework?
an import model and direct query model only have one source group. When you have more than one source group its a composite model
When is a model considered to be a composite model
When you have both direct query and import data sources or 2 separate direct query data sources
What is the data size limit when publishing to a Shared Capacity?
1GB
What is the data size limit for Dedicated Capacity?
> 10GB
8 main data reduction techniques
- Remove unneeded columns
- Remove unneeded rows
- Summarize fact table to raise grain
- Optimize column data type by using number data type
- Use Power Query for custom columns rather than calculated columns
- Disable Power Query Load
- Disable auto date/time
- Use DirectQuery table storage
How many times can a schedule refresh run in a Shared Capacity?
8 times per day