L2 Ridge Flashcards

1
Q

Unnamed: 0

A

Unnamed: 1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

L2 Ridge regularization

A

L2 regularization, also known as Ridge regularization, is a method used to prevent overfitting in machine learning models by adding a penalty term to the loss function. In summary, L2 Ridge regularization is a valuable technique to control overfitting and handle multicollinearity in machine learning models. However, it requires careful tuning of the regularization strength hyperparameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q
  1. Definition
A

L2 regularization is a type of regularization that adds a penalty term to the loss function, which is equivalent to the square of the magnitude of the coefficients.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q
  1. Mathematical Formulation
A

In L2 regularization, the penalty added to the loss function is the square of the magnitude of the coefficients, scaled by a hyperparameter usually denoted by lambda. If L(f) is the unregularized loss, the regularized loss L’(f) is then given by L’(f) = L(f) + λ||w||^2_2, where w are the model parameters.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q
  1. Shrinkage of Coefficients
A

Unlike L1 regularization, L2 regularization does not result in a sparse model with zero coefficients. Instead, it tends to distribute the weights evenly across all features, thereby shrinking the coefficients but not making them zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
  1. Advantages
A

L2 regularization helps to prevent overfitting by reducing the complexity of the model, which improves model generalization. It also works well with correlated features, distributing weights across them, unlike L1 regularization.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q
  1. Limitations
A

While L2 regularization helps with model generalization, it does not perform feature selection like L1 regularization, as it doesn’t reduce coefficients to zero. Also, choosing the right value for the regularization strength parameter (lambda) can be challenging and may require techniques like cross-validation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q
  1. Usage
A

L2 regularization is used in linear regression (Ridge regression), logistic regression, support vector machines (SVM), and neural networks among other machine learning models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
  1. Parameter Tuning
A

The strength of the L2 regularization is controlled by a hyperparameter, usually denoted by lambda or alpha. This hyperparameter needs to be carefully tuned to find the right level of regularization. Too high a value can cause underfitting, while too low a value might not effectively control overfitting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly