introduction Flashcards

(21 cards)

1
Q

What is a data warehouse?

A

A centralized storage system that integrates data from multiple sources for reporting and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the key benefits of a data warehouse?

A

Improved decision-making, historical data analysis, better reporting efficiency, and centralized data management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is OLAP?

A

Online Analytical Processing, used for analyzing data from multiple perspectives for business intelligence.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How does OLAP differ from OLTP?

A

OLAP is for complex analytical queries, while OLTP handles real-time transactional operations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does ETL stand for?

A

Extract, Transform, Load – the process of collecting, cleaning, and storing data in a data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the three types of data models?

A

Conceptual (high-level business concepts), Logical (detailed entities and relationships), Physical (database implementation details).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the five main OLAP operations?

A

Drill Down, Roll Up, Slice, Dice, and Pivot.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the difference between MOLAP and ROLAP?

A

MOLAP stores data in multidimensional cubes (faster but uses more storage), while ROLAP uses relational databases (scalable but slower for complex queries).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of dimensional modeling?

A

It organizes data into dimensions (categories) and facts (measurable values) to simplify reporting and analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Star Schema?

A

A database schema where a central fact table is linked to multiple dimension tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a Snowflake Schema?

A

A variation of the Star Schema where dimension tables are normalized into multiple related tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is Drill Down in OLAP?

A

An operation that provides more detailed views of data, such as viewing sales data at a monthly instead of yearly level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the Roll Up operation in OLAP?

A

The opposite of Drill Down, summarizing data to a higher level, such as viewing yearly instead of monthly sales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the Slice operation in OLAP?

A

Selecting a specific subset of data based on a single dimension (e.g., sales for January only).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the Dice operation in OLAP?

A

Selecting data based on multiple criteria (e.g., sales in January for a specific region).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the Pivot operation in OLAP?

A

Rotating data to change the way dimensions are viewed (e.g., switching from sales by product to sales by region).

17
Q

Why is data integration a challenge in data warehousing?

A

Because data comes from multiple sources with different formats and structures.

18
Q

What are some best practices for data warehousing?

A

Use incremental extraction, ensure data consistency, and optimize query performance.

19
Q

What is the role of metadata in a data warehouse?

A

It provides context and meaning to the stored data, helping users understand data relationships.

20
Q

What are stored and calculated measures in OLAP?

A

Stored measures are precomputed and saved, while calculated measures are computed dynamically during queries.