Chapter 1: Machine Learning for Predictive Analysis Flashcards

Question 1

Q

What is the job of data analytics?

Answer

A

Extracting insights from data

Question 2

Q

What is predictive data analytics?

Answer

A

The art of building and using models that make predictions based on patterns extracted from historical data

Question 3

Q

What are the applications of predictive data analysis?

Answer

A

price prediction (businesses), dosage prediction (doctors), risk assessment (organizations), propensity modeling (predicting the likelihood or propensity of individuals to take different actions), diagnosis (doctors, engineers and scientists), document classification

Question 4

Q

What is a prediction in data analytics? How is it different from the everyday usage?

Answer

A

In DA a prediction is the assignment of a value to any unknown variable. In everyday usage it has a temporal aspect, we predict what will happen in the future

Question 5

Q

What two things are common in all the application examples?

Answer

A

in each case, a model is used to make a prediction to help make a decision AND a model is trained to make predictions based on a set of historical examples (machine learning is used to train these models)

Question 6

Q

What is machine learning?

Answer

A

Machine learning is an automated process that extracts patterns from data.

Question 7

Q

What is supervised machine learning used for?

Answer

A

We use supervised machine learning to build the models used in predictive data analytics applications
- They have labels/ classes/ events that provide us with feedback while learning

Question 8

Q

How do they work?

Answer

A

They automatically learn a model of the relationship between a set of descriptive features and a target feature based on a set of historical examples (instances)

Question 9

Q

What is each row of a dataset called?

Answer

A

training instance

Question 10

Q

What is the overall dataset called?

Answer

A

training dataset

Question 11

Q

When is a model consistent?

Answer

A

when there are no instances in the dataset for which the model does not make a correct prediction

Question 12

Q

What do machine learning algorithms do?

Answer

A

automate the process of learning a model that captures
the relationship between the descriptive features and the target feature in a dataset.

Question 13

Q

Why is searching for consistent models not enough to learn useful prediction models?

Answer

A

When dealing with large databases there will likely be noise
The training set represents only a small sample of the possible set of instances in the domain.

Question 14

Q

What is an ill-posed problem?

Answer

A

An ill-posed problem is a problem for which a
unique solution cannot be determined using only the information that is available

Question 15

Q

Why is machine learning an ill-posed problem?

Answer

A

A single consistent model cannot be found based on the sample training dataset alone

Question 16

Q

What is generalization?

Answer

A

The ability to make predictions for queries that are not present in the data
A prediction model that makes the correct predictions for these
queries captures the underlying relationship between the descriptive and target features
and is said to generalize well

Question 17

Q

What is the goal of machine learning?

Answer

A

Finding the predictive model that generalizes best

Question 18

Q

What is inductive bias?

Answer

A

A set of assumptions that defines the model selection criteria of a machine learning algorithm

Question 19

Q

What are the types of inductive bias?

Answer

A

Restriction bias
-Preference bias

Question 20

Q

What is restriction bias?

Answer

A

It constrains the set of models that the algorithm will
consider during the learning process
Similar to choosing your go-to study method
Tells us what our model is able to represent

Question 21

Q

What is preference bias?

Answer

A

-It guides the learning algorithm to
prefer certain models over others
- Choosing out convergence/satisfaction mechanism
- I like group study but prefer to be the leader than weakest link
- Algorithm’s belief about what makes a good hypothesis

Question 22

Q

What are examples of restriction bias?-

Answer

A

In multivariable linear regression with gradient descent we only consider models that produce description based on a linear combination of the descriptive features
In Iterative Dichotomizer 3 we only consider tree-like prediction models where each branch encodes a sequence of checks on individual descriptive features

Question 23

Q

What are examples of preference bias?

Answer

A

In MLR with GD we linearly combine the descriptive features using only weights that were found though our gradient descent approach
In ID3 we are preferring shallower (less complex) trees over larger/deeper trees

Question 24

Q

Why is inductive bias necessary for learning beyond the dataset?

Answer

A

Without it we could only perform memorization of our training dataset without generalization capacity

Question 25

Q

What is model induction?

Answer

A

The creation of models from data

Question 26

Q

What is the difference between classification problem and regression problem?

Answer

A

Classification problem has the target as a category
Regression problem has the target as a number

Question 27

Q

What is another name for dataset?

Answer

A

-One whose form is the same is a table/relation of a database
- Worksheet of a spreadsheet
- Array in math

Question 28

Q

What is an instance

Answer

A

Row, tuple, record of a database table
Case in statistics
Object of a class in programming
Datapoint or vector in math

Question 29

Q

What is an independent variable?

Answer

A

It is the attribute supplied as input
Also known as explanatory variable, inputs, predictors
-Features are the table’s columns

Question 30

Q

What is dependent variable

Answer

A

The target variable whose values are to be predicted.
Aka class or label or output

Question 31

Q

What are some confusing facts about independent and dependent variables?

Answer

A

Independent variables may not be independent on each other or anything else
Dependent variables does not always depend of all the independent variables

Question 32

Q

Facts about the target variable

Answer

A

Sometimes it is considered to be included in the set of features, sometimes it is not
The target variable is not used to predict itself
Prior values may be helpful to predict future values and may be included as input features

Question 33

Q

What is the process of building a model (or training your classifier) from historical data?

Answer

A

Induction, learning, training or generalization

Question 34

Q

When does the real value of machine learning become apparent?

Answer

A

When we want to build prediction models from large datasets with multiple features

Question 35

Q

How do you know the number of possible prediction models?

Answer

A

There are three descriptive features so there are 2^3 possible combinations of descriptive feature values
For each descriptive feature there are 3 possible target feature values
There are 3^8 = 6,561 possible prediction models

Question 36

Q

What is the ability to memorize a training dataset?

Answer

A

Consistency

Question 37

Q

What does Occam’s Razor say about simplicity?

Answer

A

With all things being equal the simplest explanation tends to be the right one (upper bound)

Question 38

Q

What does albert Einstein say about simplicity

Answer

A

Everything should be made as simple as possible but not simpler (lower bound)

Question 39

Q

What are the sources of information that guide machine learning algorithms?

Answer

A

Training data
Inductive bias of the algorithm

Question 40

Q

What can go wrong with machine learning?

Answer

A

Inappropriate inductive bias which leads to mistakes

Question 41

Q

What does no free lunch mean?

Answer

A

if an algorithm does well on a certain class of problems then it necessarily pays for that with degraded performance on the set of all remaining problems

Question 42

Q

What happens if we choose the wrong inductive bias?

Answer

A

Underfitting (the prediction model is oversimplifies)
-Overfitting (the prediction model is so complex that it becomes too sensitive to noise in the data, it memorizes)

Question 43

Q

What is a Goldilocks model?

Answer

A

A model that is just right and strikes a good balance between overfitting and underfitting
it is found by using algorithms with appropriate inductive biases

Question 44

Q

What is CRISP-DM

Answer

A

Cross Industry Standard Process for Data Mining is a data mining process model that describes commonly used approaches that data mining experts use to tackle problems

Question 45

Q

What are the phases?

Answer

A

-Business Understanding- defining customers’ needs, understanding project objectives
-Data Understanding- collection and data familiarity
-Data Preparation- construct final dataset from raw data
-Modeling- select machine learning techniques relevant to the problems and their parameters are calibrated to optimal values
-Evaluation- outcome collection, compare obtained model with business objectives
-Deployment- put into production, organize and present knowledge gain in a way that the customer can use it

Question 46

Q

List other data life cycle models

Answer

A

-Semma- Sample, Explore, Modify, Model, Assess
- Data Mining and Knowledge Discovery from Data (KDD)- mostly used in the real world

Question 47

Q

What is supervised machine learning based on?

Answer

A

The assumption that data does not change over time. They create models that distinguish between classes present in the dataset they are induced from

Brainscape's Knowledge GenomeTM

Chapter 1: Machine Learning for Predictive Analysis Flashcards

Brainscape's Knowledge Genome^TM