Business Intelligence & Big Data Flashcards

1
Q

What is Business Intelligence

A

Includes a wide range of applications, practices and technologies for the extraction, transformation, integration, analysis, interpretation and presentation of data to support improved decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a Data Warehouse?

A

A database that stores large amounts of historical data in a form that readily supports analysis and management decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is ETL?

A

A process used to pull data from different data sources to populate and maintain the data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Explain the Extract step

A

Designed to access the various data sources and pull from each source the data desired to update the data warehouse

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Explain the transform step

A

The data that will be used to update the data warehouse is edited and, if necessary, converted to a different format

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Explain the load step

A

Updates the existing data warehouse with the data that have passed through the extract and transform steps. This creates a new, updated version of the data warehouse.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a Data Mart

A

A smaller version of a data warehouse-scaled down to meet the specific needs of a specific business unit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Big Data?

A

Data collections that are enormous and complex that traditional relational management software, hardware and analysis processes are incapable of dealing with them

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

ACID

A

Atomicity Consistency Isolation Durability
- properties that guarantee database transactions are processed reliably and ensure the integrity of data in the database.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is In-Memory Database?

A

A database management system that stores an entire database in random access memory (RAM)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Business Intelligence Tools

A

Spreadsheets
Dashboards
OLAP (Online Analytical Processing)
Drill-Down Analysis
Data Mining
Reporting and Quering Tools

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is OLAP?

A

A method used to analyze multidimensional data from many different perspectives
- Dimensions: Time, metric, location

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is Drill-Down Analysis?

A

Involves the interactive examination of high-level summary data in increasing detail to gain insight into certain elements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain Data Mining

A

A BI tool used to explore large amounts of data for hidden patterns to predict future trends and behaviors for use in decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Process of Data Mining

A
  1. Selection of the data to be used
  2. Preprocessing of the data to remove erroneous, incomplete or inconsistent data
  3. Transformation of the data to create an appropriate set of data to support the data mining technique to be used
  4. The actual data mining process
  5. Evaluation of the results
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Name the Data Mining Techniques

A
  1. Neural Computing
  2. Case-based reasoning
  3. Association analysis
17
Q

Explain Case-Based Reasoning

A

Historical if-then-else cases are used to recognize patterns

18
Q

Explain Association Analysis

A

A specialized set of algorithms sorts through data and forms statistical rules about relationships among items

19
Q

Explain Neural Computing

A

Historical data is examined for patterns that are then used to make predictions

20
Q

What are Dashboards?

A

Dashboards present a set of key performance indicators about the the state of a process at a specific point in time