Chapter 2 - Normalization Flashcards

1
Q

Normalization

A

RAOC

  • The process of organizing tables in a relational database
  • Eliminates data redundancy
  • Decreases record locking
  • Increases efficiency in concurrency
  • Reduces data anomalies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Characteristics of a Normalized Database

A

DEIL

  1. detailed data
  2. data efficiently
  3. data integrity
  4. Results in a lot of tables
  5. Excellent for managing our day to day activities
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Drawbacks of a Normalized Database

A
  • Inefficient for data extraction, poor data analysis (Because it is stored in the most detailed format)
  • stored in multiple databases more data is added queries become even more inefficient
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Dimensional Modeling

A

data warehouse databases are designed using the Dimensional Model while Relational databases (OLTP) are designed using the Entity Relationship (ER) model

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

star schema

A
  1. A single central table called a fact table
  2. Fact table is surrounded by multiple tables called dimensions
  3. A fact table represents a multi dimensional hypercube
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Shared or Conformed dimensions

A

Dimensions with connections to multiple fact tables are called shared or conformed
dimensions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Snowflake vs Star schema

A

If you imagine multiple dimensions brought to 3NF with a central fact table you
end up with a Snowflake schema. If you take a Star schema and normalize it, you end up with a
Snowflake schema.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Hybrid Schema

A

A hybrid schema occurs when you normalize only part of the design to accommodate shared
dimensions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Star vs Snowflake vs Hybrid

A

• Use the Star schema if you can
• Use the Hybrid schema if you need to share a dimension with multiple dimensions
• Only use the Snowflake schema for quick Proof of Concept projects
o Because the design is closest to your source database that is already in 3NF and will
therefore require less time to construct and will be easier to load data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Granularity

A

the level of detail that data is stored in fact tables /dimensions.
The lower the level of detail, the larger the data amount in the fact table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Dimensionality

A

The dimensional granularity of fact table is dependent on the number of the dimensions connected to
the fact table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly