machine learning Flashcards
(113 cards)
What are model performance metrics?
Performance metrics offer various perspectives on the model’s performance, allowing data scientists to choose appropriate evaluation criteria based on project objectives.
What is Ridge Regularization?
Ridge Regularization (L2) modifies overfitted or underfitted models by adding a penalty equal to the sum of the squares of the coefficients.
What does Ridge Regression aim to achieve?
Ridge Regression aims to reduce model complexity while keeping all predictors in the model.
What does the loss function in Ridge Regression consist of?
The loss function consists of the residual sum of squares (RSS) and a penalty term controlled by λ.
What is the effect of a larger λ in Ridge Regression?
A larger λ forces the coefficients to shrink more.
What is Lasso Regularization?
Lasso Regularization (L1) prevents overfitting by adding a penalty term that penalizes the sum of the absolute values of the model’s coefficients.
What is the main benefit of Lasso Regularization?
Lasso Regularization reduces overfitting by promoting sparsity and implicitly performing feature selection.
How does Lasso Regularization affect coefficients?
Lasso can shrink some coefficients to exactly zero, simplifying the model by excluding some features altogether.
What is Elastic Net?
Elastic Net is a hybrid method that combines both Ridge and Lasso penalties.
When is Elastic Net particularly useful?
Elastic Net is useful when there are many correlated predictors.
What are the three main types of regularization techniques?
- Ridge Regularization (L2) * Lasso Regularization (L1) * Elastic Net (Hybrid Model)
What is the purpose of regularization techniques?
Regularization techniques are used to prevent overfitting and improve model performance.
What is the confusion matrix?
A confusion matrix is a table used to describe the performance of a classification model on a set of test data.
What does accuracy measure?
Accuracy measures the proportion of correct predictions among the total number of cases examined.
What is precision in classification metrics?
Precision is the ratio of true positive predictions to the total positive predictions.
What is recall in classification metrics?
Recall is the ratio of true positive predictions to the actual positives in the data.
What is the F1 Score?
The F1 Score is the harmonic mean of precision and recall, providing a balance between the two.
What does high bias indicate?
High bias indicates that the model is unable to learn the patterns in the data, leading to underfitting.
What does high variance indicate?
High variance indicates that the model learns noise from the training data, leading to overfitting.
What is the role of regularization in bias and variance?
Regularization techniques help to balance between bias and variance.
What is logistic regression used for?
Logistic regression is used for predicting a categorical dependent variable using independent variables.
How does logistic regression differ from linear regression?
Logistic regression predicts probabilities for categorical outcomes, while linear regression predicts continuous values.
What is the sigmoid function?
The sigmoid function maps predicted values to probabilities between 0 and 1, forming an S-shaped curve.
What are the three types of logistic regression?
- Binomial * Multinomial * Ordinal