Data Integration and Modelling Flashcards

(11 cards)

1
Q

What are the components of a Conceptual Model in a Data Warehouse?

A

Facts, Measures and Dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Define Facts, Measures and Dimensions

A

Facts: Focus of interest for decision making.
E.g Sales, Shipments

Measures: Attributes that describe facts from different points of view.
E.g. Each sale is measured by its revenue.

Dimensions: Discrete attributes which determine the granularity adopted to represent facts
E.g. Product, Store, Date

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of a Multidimensional E/R Model?

A

Its purpose is to create an intuitive representation
of the multidimensional data that is optimized for
high-performance access

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How are dimension hierarchies organised?

A

Dependencies between the classification levels are described by the classification schema through functional dependencies.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the advantages of using a Star Schema?

A

Improved Query performance for often-used data
Less tables and simple structure
Efficient query processing with regard to dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a data cube?

A

Cubes consist of data cells with one or more dimensions.
The coordinates of a cell are the classification nodes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are some of the properties of dimension tables?

A
  1. Contain many descriptive columns: dimension tables are wide.
  2. Generally don’t have too many rows.
  3. Contents are relatively static: almost like a lookup table.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What are some of the uses of dimension tables?

A

Some of the uses of dimension tables include:
1. Filters are based on dimension attributes.
2. Grouping columns are dimension attributes.
3. Fact tables are references through dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages and disadvantages of using a snowflake schema?

A

The main advantage of using a snowflake schema is that the queries can run faster, as the size of the dimension tables can be reduced.

Disadvantages:
Fact tables are responsible for 90% of the storage requirements. Thus, normalising the dimensions usually lead to insignificant improvements.

Normalisation of the dimension-tables can reduce the performance of the DW because it leads to a large number of tables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are the advantages and disadvantages of using a star schema?

A

The main advantages of using a star schema are:
- Improved query performance for often-used data
- Less tables and simple structure
- Efficient query processing with regard to dimensions.

The main disadvantage is:
- High overhead of redundant data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a star schema and what are the advantages/disadvantages of using this model?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly