CH1 Flashcards

(45 cards)

1
Q

What is the course objective of CCDS 221 Data Warehouse?

A

To introduce Data Warehouse concepts and highlight the differences between a conventional IS and a Decision Support System (DSS)

Focus on designing, deploying, and querying a multidimensional DW using appropriate software tools.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the key topics covered in the course outline?

A
  • Introduction to Data Warehouse (DW)
  • Multidimensional Model
  • Data Mart (DM) Design
  • Multidimensional Implementation
  • ETL Process and Data Cleansing
  • OLAP Algebra
  • More about Facts and Dimensions
  • Introduction to Multidimensional constraints
  • Introduction to Real-Time DW
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the definition of Data Warehousing?

A

The step of the KDD process that consists of designing, implementing, and using a DW to support the decision-making process

A Data Warehouse is a special database built by integrating data from heterogeneous data sources.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is Data Mining?

A

A step in the KDD process that applies data analysis and discovery algorithms to produce patterns or models over the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the primary difference between OLTP and OLAP systems?

A
  • OLTP: Transactional
  • OLAP: Decisional
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What type of data does a Data Warehouse (DW) contain?

A

Subject-oriented, integrated, time-variant, and non-volatile data

This supports the management’s decision-making process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are the characteristics of a Data Warehouse?

A
  • Subject-Oriented
  • Integrated
  • Time-Variant
  • Non-Volatile
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does ‘Time-Variant’ mean in the context of a Data Warehouse?

A

Historical data is kept in the DW, allowing retrieval of data from various time periods.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the purpose of ETL in data warehousing?

A

ETL stands for Extract, Transform, Load; it is used to integrate data into the Data Warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a Data Mart (DM)?

A

A subset of a Data Warehouse, focused on a specific business area or department.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

List two examples of where large data volumes are generated.

A
  • Banks
  • Telecommunication companies
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

True or False: OLTP systems are designed to support decision-making.

A

False

OLTP systems focus on operational tasks, not decision-making.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the role of Knowledge Discovery from Data (KDD)?

A

To extract value from large volumes of data and use it in the decision-making process.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Fill in the blank: A _______ is a set of DML commands considered as a whole.

A

Transaction

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the definition of a Decision Support System (DSS)?

A

An Information System dedicated to predictive management.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What issues arise from the use of traditional IS in decision-making?

A

They often provide too much data but lack useful information for decision-makers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a common problem with data in OLTP systems?

A

Data are stored at a very high level of detail, making it difficult for management to use.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the significance of data integration in a Data Warehouse?

A

It resolves conflicts between different data sources and unifies data formats.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is meant by ‘Non-volatile’ in the context of a Data Warehouse?

A

Once data is loaded in the DW, it will not change.

20
Q

What are the main objectives of OLAP systems?

A
  • Support decision-making
  • Allow for ad-hoc queries
  • Aggregate data for analysis
21
Q

What example illustrates the need for aggregated information over detailed data?

A

Providing detailed hotel bills and tickets to a manager instead of a summary of travel fees.

22
Q

What does the Multidimensional Model allow in data warehousing?

A

It enables more user-friendly navigation and querying by decision-makers.

23
Q

What are the five conventional operational features of OLTP?

A
  • Data Input
  • Storage
  • Processing
  • Querying
  • Dissemination
24
Q

What is the purpose of OLAP Algebra?

A

To perform operations on multidimensional data for analysis.

25
What is a significant challenge in analyzing large datasets?
Organizing data from disparate sources in a coherent manner.
26
What type of analyses are expected from a Data Warehouse?
Aggregated and historical analyses to support decision-making.
27
What does the term 'Big Data' refer to?
The massive quantities of data generated daily by various sectors.
28
What is the relationship between Data Warehousing and Business Intelligence?
Data Warehousing supports the analysis and decision-making process in Business Intelligence.
29
What is the main goal of the KDD process?
To extract useful knowledge from large datasets.
30
What is the primary function of a Data Warehouse (DW)?
Support of management's decision-making process ## Footnote A DW aggregates data to assist in decision-making.
31
What does DSS stand for?
Decision Support System ## Footnote A DSS is built around a corporate memory.
32
What are the two paradigms of Enterprise Data Warehouse Architecture?
Bill Inmon's paradigm and Ralph Kimball's paradigm ## Footnote These paradigms represent different data warehousing philosophies.
33
In Bill Inmon's paradigm, how is data stored in the Data Warehouse?
In 3NF (Third Normal Form) ## Footnote This is a standard for database normalization.
34
In Ralph Kimball's paradigm, how is data stored in the Data Warehouse?
In a dimensional model ## Footnote This approach emphasizes ease of use for analytical purposes.
35
What is the relationship between Data Warehouses and Data Marts in Inmon's paradigm?
A DW is one part of the overall BI system; data marts source their information from the DW. ## Footnote This indicates a hierarchical structure of data storage.
36
What is the purpose of a Data Mart?
To extract specific data for a group of Managers ## Footnote Data Marts are modeled for analytical tools.
37
What are the two storage spaces in Enterprise DW Architecture?
1 Central DW and many DMs ## Footnote DMs refer to Data Marts.
38
What is a key indicator for decision-makers regarding business activities?
Measures determined by correlations/consolidations of datasets ## Footnote These measures should be independent of operational procedures.
39
Fill in the blank: The way according to which data is perceived should be completely independent of the _______.
data structures and procedures of transactional systems ## Footnote This independence is crucial for effective decision-making.
40
What is the ETL process in the context of Data Warehousing?
Extract, Transform, Load ## Footnote ETL is critical for loading the DW/DM from data sources and solving heterogeneity problems.
41
True or False: The RDB and OLTP Systems are convenient for answering Decision-Makers' requirements.
False ## Footnote These systems do not meet the analytical needs of decision-makers.
42
What does the Multidimensional Model highlight in a Data Mart?
The business activity (Fact) to analyze and its analysis axes (Dimensions) ## Footnote This model is designed to facilitate analysis.
43
What is a key challenge in querying a Data Warehouse?
Queries are very hard to express despite the simplicity of the query language used ## Footnote This complexity arises from the nature of the data model.
44
What type of data does a Data Warehouse store?
Big volumes of data from heterogeneous sources ## Footnote These sources can include organizational databases, partner databases, and the internet.
45
What percentage of a DW project is the ETL process estimated to cost?
Up to 80% ## Footnote This highlights the significant investment in this phase of a DW project.