Data Mining - Chapter 1 Flashcards

1
Q

What is business analytics (BA)?

A

The practice and art of bringing quantitative data to bear on decision making.

-> It includes a range of data analysis methods

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Wht is Business Intelligence (BI)?

A

the next level of business analytics, which focusses on data visualization and reporting to understand what happened and what is happening.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the purpose of statistcal models such as regression models?

A
  • To describe and quantify on average relationships
  • To predict new records
  • To forecast future values
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is data mining?

A

Business analytics methods that go beyond counts, descriptive techniques, reporting, and methods based on business rules.

  • -> Statistical and mache-learning methods that inform decision-making.
  • -> In general not focused on average predictions, but on specific case predicitons
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What are the four V’s in big data?

A
  1. Volume
  2. Velocity
  3. Variety
  4. Veracity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is meant with volume?

A

The amount of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is meant with velocity?

A

The flow rate of data - the speed at which it is generated and changed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is meant with variety?

A

The different types of data being generated (pictures, text, numbers etc.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is meant with veracity?

A

Data is being generated by organic distributed processes and not subject to controls or quality checks.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is data science?

A

A mix of skills in the areas of statistics, machine learning, math, programming, business and IT.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is an algorithm?

A

A specific procedure used to implement a particular data mining technique.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a model?

A

An algorithm as applied to a dataset, complete with its settings.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a score?

A

A predicted value or class.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly