Linear Regression Flashcards

Question 1

Q

What is supervised learning?

Answer

A

A learning paradigm where a model is trained on input-output pairs (x_i, y_i) to learn a mapping from inputs to outputs.

Question 2

Q

How is the linear regression model expressed using basis functions?

Answer

A

f(x) = ∑ₖ βₖ φₖ(x), where φₖ are basis functions (e.g., φ₀(x) = 1, φ₁(x) = x).

Question 3

Q

What loss function does linear regression use and why?

Answer

A

Mean Squared Error: L = (1/n) ∑ᵢ (yᵢ − f(xᵢ))²; penalizes larger errors and is differentiable.

Question 4

Q

What is the normal equation for the closed-form solution?

Answer

A

β = (Φᵀ Φ)⁻¹ Φᵀ y, where Φ is the design matrix of basis functions.

Question 5

Q

How does gradient descent optimize parameters?

Answer

A

β_new = β_old − rate × ∂L/∂β, thus moving each step in the negative‐gradient direction to reduce the loss.

Question 6

Q

What is the gradient descent update rule in linear regression?

Answer

A

β_new = β_old – rate * (–2 * Φ^T * y + 2 * Φ^T * Φ * β_old), Step in the negative‐gradient direction of the MSE loss.

Question 7

Q

How does learning rate affect gradient descent convergence?

Answer

A

A small learning rate yields slow convergence; a large learning rate can cause oscillation or divergence.

Question 8

Q

How is the design matrix Φ?

Answer

A

Φ = np.vstack((np.ones(n), x)), stacking a row of ones (for the intercept) on top of the feature row.

(8 cards)