3H. Optimize a model for performance in Power BI Flashcards

Question

What is the trade-off to improving model performance by reducing cardinality/granularity?

Answer 1

Deciding to summarize fact-type data will always involve a tradeoff with the detail of your data. A disadvantage is that you might lose the ability to drill into data because the detail no longer exists.

Answer 2

In short, have on Import summary table, and one DirectQuery detailed fact table, and use them for separate report pages to answer different questions. In Power BI Desktop, a Mixed mode design produces a composite model. Essentially, it allows you to determine a storage mode for each table. Therefore, each table can have its Storage Mode property set as Import or DirectQuery. An effective technique to reduce the model size is to set the Storage Mode property for larger fact-type tables to DirectQuery. This design approach can work well in conjunction with techniques that are used to summarize your data. For example, the summarized sales data could be used to achieve high performance "summary" reporting. A drill-through page could be created to display granular sales for specific (and narrow) filter context, displaying all in-context sales orders. The drill-through page would include visuals based on a DirectQuery table to retrieve the sales order data (sales order details).

Answer 3

- The overall user experience depends heavily on the performance of the underlying data source. Slow query response times will lead to a negative user experience and, in the worst-case scenarios, queries might time out. - Also, the number of users who are opening the reports at any one time will impact the load that is placed on the data source. For example, if your report has 20 visuals in it and 10 people are using the report, 200 queries or more will exist on the data source because each visual will issue one or more queries. - Network latency; faster networks return data quicker. - The performance of the data source's server and how many other workloads are on that server. For example, consider the implications of a server refresh taking place while hundreds of people are using the same server for different reasons.

Answer 4

- It is suitable in cases where data changes frequently and near real-time reporting is required. - It can handle large data without the need to pre-aggregate. - It applies data sovereignty restrictions to comply with legal requirements. - It can be used with a multidimensional data source that contains measures such as SAP Business Warehouse (BW).

Answer 5

You can pin visuals, or entire report pages, as dashboard tiles. The tiles are automatically refreshed on a schedule, for example, every hour. You can control the frequency of this refresh to meet your requirements. When you open a dashboard, the tiles reflect the data at the time of the last refresh and might not include the latest changes that are made to the underlying data source. You can always refresh an open dashboard to ensure that it's up-to-date.

Answer 6

- Performance - As previously discussed, your overall user experience depends heavily on the performance of the underlying data source. - Security - If you use multiple data sources in a DirectQuery model, it is important to understand how data moves between the underlying data sources and the associated security implications. You should also identify if security rules are applicable to the data in your underlying source because, in Power BI, every user can see that data. - Data transformation - Compared to imported data, data that is sourced from DirectQuery has limitations when it comes to applying data transformation techniques within Power Query Editor. For example, if you connect to an OLAP source, such as SAP BW, you can't make any transformations at all; the entire external model is taken from the data source. If you want to make any transformations to the data, you will need to do this in the underlying data source. - Modeling - Some of the modeling capabilities that you have with imported data aren't available, or are limited, when you use DirectQuery. - Reporting -- Almost all the reporting capabilities that you have with imported data are also supported for DirectQuery models, provided that the underlying source offers a suitable level of performance. However, when the report is published in Power BI service, the Quick Insights and Q&A features are not supported. Also, the use of the Explore feature in Excel will likely result in poorer performance.

Answer 7

Improve the underlying data source itself. Will likely require working together with a data engineer. Consider the use of the following standard database practices that apply to most situations: - Avoid the use of complex calculated columns because the calculation expression will be embedded into the source queries. It is more efficient to push the expression back to the source because it avoids the push down. You could also consider adding surrogate key columns to dimension-type tables. - Review the indexes and verify that the current indexing is correct. If you need to create new indexes, ensure that they are appropriate.

Answer 8

The same way as with an Import model (i.e. start by running Performance analyzer, etc., etc.). You can also use Query Reduction (File > Options and settings > Options, scrolling down the page, and then selecting the Query reduction option). The following query reduction options are available: - Reduce number of queries sent by - By default, every visual interacts with every other visual. Selecting this check box disables that default interaction. You can then optionally choose which visuals interact with each other by using the Edit interactions feature. - Slicers - By default, the Instantly apply slicer changes option is selected. To force the report users to manually apply slicer changes, select the Add an apply button to each slicer to apply changes when you're ready option. - Filters - By default, the Instantly apply basic filter changes option is selected. To force the report users to manually apply filter changes, select one of the alternative options: - Add an apply button to all basic filters to apply changes when you're ready - Add a single apply button to the filter pane to apply changes at once (preview) Applying these options prevents queries from continuously hitting the data source, which should improve performance.

Answer 9

- If you are dealing with a large amount of data (big data), aggregations will provide better query performance and help you analyze and reveal the insights of this large data. Aggregated data is cached and, therefore, uses a fraction of the resources that are required for detailed data. - If you are experiencing a slow refresh, aggregations will help you speed up the refresh process. The smaller cache size reduces the refresh time, so data gets to users faster. Instead of refreshing what could be millions of rows, you would refresh a smaller amount of data instead. - If you have a large semantic model, aggregations can help you reduce and maintain the size of your model. - If you anticipate your semantic model growing in size in the future, you can use aggregations as a proactive step toward future proofing your semantic model by lessening the potential for performance and refresh issues and overall query problems.

Answer 10

- If you have access to the database, you could create a table with the aggregation and then import that table into Power BI Desktop. - If you have access to the database, you could create a view for the aggregation and then import that view into Power BI Desktop. - In Power BI Desktop, you can use Power Query Editor to create the aggregations step-by-step.

Answer 11

When the selected columns display on the page, select the Group By option on the Home tab. On the window that displays, select the column that you want to group by and enter a name for the new column. Select the Advanced option and then select the Add aggregation button to display another column row. Enter a name for the aggregation column, select the operation of the column, and then select the column to which you want to link the aggregation. Repeat these steps until you have added all the aggregations and then select OK.

Answer 12

You can open the Manage Aggregations window from any view in Power BI Desktop. In the Fields pane, right-click the table and then select Manage aggregations. For each aggregation column, you can select an option from the Summarization drop-down list and make changes to the selected detail table and column. When you are finished managing the aggregations, select Apply All.

3H. Optimize a model for performance in Power BI Flashcards

(36 cards)