Statistical Data Mining Flashcards

(7 cards)

1
Q

Loss

A

is a numerical metric that describes how wrong a model’s predictions are. Loss measures the distance between the model’s predictions and the actual labels

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Linear Regression

A

is a statistical technique used to find the relationship between variables. In an ML context, linear regression finds the relationship between features and a label.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do you decide between Mean Absolute Error and Mean Standard Errors?

A

Choosing between MAE and MSE depends on the dataset and how you want the model to handle outliers, with MSE penalizing them more heavily.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The prediction formula for linear regression model

A

Prediction = Bias + (Weight × Feature value)
the weight is the coefficient or multiplier used in the linear model
The feature value is the actual data point for the input variable you’re using.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

y’ = b + w1x1
bias

A

Bias is the same concept as the y-intercept in the algebraic equation for a line. In ML, bias is sometimes referred to as
. Bias is a parameter of the model and is calculated during training.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

y’ = b + w1x1
weight

A

w1 Weight is the same concept as the slope
in the algebraic equation for a line. Weight is a parameter of the model and is calculated during training.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

y’ = b + w1x1
feature

A

x1 is a feature—the input.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly