3A. Describe Power BI Desktop models Flashcards

1
Q

What are the high-level steps to create a data model in Power BI?

A
  • Connect to data
  • Transform and prepare data
  • Define business logic by adding Data Analysis Expressions (DAX) calculations
  • Enforce data permissions with row-level security by adding roles
  • Publish the model to Power BI
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Why is it important to develop optimal data models?

A

It is important for delivering good query performance and for minimizing data refresh times and the use of service resources, including memory and CPU. The fewer resources that are used, the more models that can be hosted and at lower cost.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the default, most optimal, data model?

A

A star schema

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are some benefits to star schemas?

A

It presents a user-friendly structure and it supports high-performance analytic queries.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Describe a star schema, and the types of tables in it, and how are those tables related to each other?

A

It classifies model tables as either fact or dimension. In a diagram, a fact table forms the center of a star, while dimension tables, when placed around a fact table, represent the points of the star.

In the model, dimension tables are related to fact tables by using one-to-many relationships. The relationships allow filters and groups that are applied to dimension table columns to propagate to the fact table. This design pattern is common.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a fact table?

A

Its role is to store an accumulation of rows that represent observations or events that record a specific business activity. For example, events that are stored in a sales fact table could be sales orders and the order lines. You could also use a fact table to record stock movements, stock balances, or daily currency exchange rates. Generally, fact tables contain numerous rows. As time passes, fact table rows accumulate.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a dimension table?

A

It describes your business entities, which commonly represent people, places, products, or concepts. A date dimension table, which contains one row for each date, is a common example of a concept dimension table. The columns in dimension tables allow filtering and grouping of fact table data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What must each dimension table include?

A

Each dimension table must have a unique column, which is referred to as its key column. A unique column doesn’t contain duplicate values and it should never have missing values. In a product dimension table, the column could be named ProductKey or ProductID. Likely, additional columns will store descriptive values, like the product name, subcategory, category, color, and so on.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is an analytic query?

A

A query that produces a result from a semantic model. Each Power BI visual, in the background, submits an analytic query to Power BI to query the model. The analytic query is written as a Data Analysis Expressions (DAX) query statement. However, you don’t need to write a native DAX statement; you only need to configure report visuals by mapping semantic model fields.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are the phases of an analytic query, and in what order are they implemented?

A
  1. Filtering, or slicing, targets the data of relevance. In Power BI reports, filters can be applied to three different scopes: the entire report, a specific page, or a specific visual. Filtering is also applied in the background when row-level security (RLS) is enforced. Each report visual can inherit filters or have filters directly applied to it.
  2. Grouping, or dicing, divides query results into groups.
  3. Summarizing produces a single value result. Typically, numeric columns are summarized by using summarization methods (sum, count, and many others). These methods are simple summarizations. More complex summarizations, like a percent of grand total, can be achieved by defining measures that are written in DAX.

However, not all analytic queries need to filter, group, and summarize:
- Commonly, report visuals are filtered, perhaps by a time period or geographic location.
- Grouping is optional. For example, a card visual, which is used to display a single value, isn’t concerned with grouping.
- Typically, report visuals summarize. One notable exception, however, is the slicer visual, which isn’t concerned with summarization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the standard methodology for adding and configuring visuals in a report?

A
  1. Select a visual type, like a bar chart.
  2. Map semantic model fields, which are displayed in the Fields pane, to the visual field wells. For a bar chart, the wells are Y-axis, X-axis, Legend, Small multiples, and Tooltips.
  3. Configure mapped fields. It’s possible to rename mapped fields or toggle the field to summarize or not summarize. If the field summarizes, you can select the summarization method.
  4. Apply format options, like axis properties, data labels, and many others.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does the word “field” refer to in the context of Power BI reports, and which model resources are fields?

A

Fields is a collective term that is used to describe a model resource that can be used to configure a visual. The three different model resources that are fields include:
- Columns
- Hierarchy levels
- Measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Which type of field/model resource cannot be used to Summarize data in a visual?

A

Hierarchy levels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Which type of field/model resource cannot be used to Group data in a visual, and can only filter data in special cases?

A

Measures

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can you use columns in a visual?

A

To filter, group, and summarize column values. Summarizing numeric columns is common, and it can be done by using sum, count, distinct count, minimum, maximum, average, median, standard deviation, or variance. You can also summarize text columns by using first (alphabetic order), last, count, or distinct count. Additionally, you can summarize date columns by using earliest, latest, count, or distinct count.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How do you use columns to group data in a visual?

A

Set summarization type to “Do not summarize”

17
Q

How can you use hierarchy levels in a visual?

A

To filter and group but not to summarize. Report authors can summarize the column that the hierarchy level is based on, provided that it’s visible in the Fields pane.

18
Q

How can you use measures in a visual?

A

They are designed to summarize model data; they can’t be used to group data, and can be used to filter data in one special case.

To use a measure to filter a visual, the visual needs to display the measure, and the measure needs to be applied as a “visual-level” filter. When used in this way, a measure filter is applied after the analytic query has summarized data. This process is done to eliminate groups where the measure filter condition isn’t true. (For those who are familiar with SQL syntax, a measure that used to filter a visual is like the HAVING clause in a SELECT statement.)