Statistical Data Mining Flashcards
(7 cards)
Loss
is a numerical metric that describes how wrong a model’s predictions are. Loss measures the distance between the model’s predictions and the actual labels
Linear Regression
is a statistical technique used to find the relationship between variables. In an ML context, linear regression finds the relationship between features and a label.
How do you decide between Mean Absolute Error and Mean Standard Errors?
Choosing between MAE and MSE depends on the dataset and how you want the model to handle outliers, with MSE penalizing them more heavily.
The prediction formula for linear regression model
Prediction = Bias + (Weight × Feature value)
the weight is the coefficient or multiplier used in the linear model
The feature value is the actual data point for the input variable you’re using.
y’ = b + w1x1
bias
Bias is the same concept as the y-intercept in the algebraic equation for a line. In ML, bias is sometimes referred to as
. Bias is a parameter of the model and is calculated during training.
y’ = b + w1x1
weight
w1 Weight is the same concept as the slope
in the algebraic equation for a line. Weight is a parameter of the model and is calculated during training.
y’ = b + w1x1
feature
x1 is a feature—the input.