Big Quiz 4 Flashcards

(21 cards)

1
Q

Data Mining

A

Collecting, aggregating, and visualizing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

artificial intelligence

A

replicate human reasoning and decision making

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

machine learning

A

prediction, and update (predictions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Bi Stack

A

the set of technologies needed for data analysis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Data Sources

A

Own [crm, scm, acc, hrs, erp]
third party DBS
Research & Partners

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Primary vs secondary sources

A

only research is primary, everything else is secondary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

ETL

A

Extract the data from a data source

Transform the data by cleaning and aggregating it

Load the data into a data warehouse where it can be accessed for future analytics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Data Lake vs Warehouse

A

Lakes are dirty
Warehouses are organized
Notice the warehouse is after the ETL step

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Analytical vs operational

A

warehouse - analytical

source - operational

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

KPI

A

KEY: only the most important measures
Performance: how are we doing as a company?
Indicators: shows us how we are doing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Good Measures

A

Simple
Should be easy to understand

Easy to obtain
Shouldn’t be time/cost/etc. prohibitive to obtain

Precisely Definable
Only one way to interpret it

Objective
Not opinion based

Robust
Not likely to heavily swayed by outside factors

Valid
Are we measuring the right thing the right way?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Types of analytics

A

Descriptive - Understanding the data you DO have (past values)

Predictive - Understanding the data you do NOT have (future)

Prescriptive - ‘What if’ scenarios (compares options)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Clustering

A

Grouping customers or products and creating unique strategies for each segment.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Key influencer analysis

A

Identifying the most influential variables by measuring correlation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Forecasting

A

Prediciting future values over interval time periods based on known values of the same timeframe.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Recommendation Analysis

A

Predicting items that a customer may want to purchase based on the shopping baskets of other customers.

18
Q

Dependant Variable

A

The thing we are trying to predict
Y
Label

19
Q

Independent Variable

A

The inputs to our prediction
X variables
Features

20
Q

Important Terms

A

Null Hypothesis
- Nothing is happening beyond random chance.

Alternative Hypothesis
- Something is happening

P-Value
< 0.05

Effect Size
Measures the amount of impact one variable has on another

21
Q

Important Terms 2

A

Trendline:
A line in a scatterplot that shows the direction a relationship takes

Regression Line:
A line that fits the data the best by minimizing residuals (The distance between the data points and the line)

R-Squared Stat
0 - 1. The closer it is to 1, the more of your data your model explains. Measures how good your model is