Lecture 6: Logistic Regression (Alt 3) Flashcards
(43 cards)
What type of outcome variable does linear regression model?
A continuous outcome (e.g., liking a person).
What assumption underlies the equation used in linear regression?
A linear relationship that incorporates an intercept and slope coefficient.
What are residuals in the context of linear regression?
Deviations from the predicted line, reflecting model imperfection.
Why is logistic regression introduced as an alternative to linear regression?
Because linear regression fails with dichotomous outcomes.
What kind of outcome variable necessitates the use of logistic regression?
A categorical outcome (e.g., labeling someone as a boyfriend/girlfriend).
What are two key differences between logistic and linear regression?
Predictors are not assumed to be normally distributed, and logistic regression can deal with non-linear relationships among variables.
What function does logistic regression use to model probabilities?
A logistic function that transforms continuous predictions into a 0–1 probability range.
What does it mean for outcome categories in logistic regression to be mutually exclusive?
Each observation fits into only one category.
What does it mean for outcome categories in logistic regression to be exhaustive?
All possible outcome categories are represented.
What happens if outcomes are not mutually exclusive or exhaustive in logistic regression?
The probabilities won’t sum to 1, the model cannot correctly classify or predict the outcome for each case, and the likelihood function will break down.
What is the shape of the curve produced by the logistic function?
S-shaped (sigmoidal).
What does the logistic regression equation do to convert linear predictions into probabilities?
Exponentiates a linear combination of predictors to yield probabilities bounded between 0 and 1.
What do odds express in logistic regression?
The ratio of event likelihood to non-event likelihood.
What mathematical scale is used for logistic regression coefficients?
Log odds (the natural logarithm of odds).
What does each unit increase in a predictor change in terms of log odds?
It changes the log odds by the coefficient B.
What does exponentiating the coefficient B yield in logistic regression?
The odds ratio.
How do odds differ from probabilities in logistic regression?
Odds and probabilities have a non-linear relationship.
Why is the change in probability not constant across the logistic curve?
Because changes in probability are largest near 0.5 and smaller as the predicted probability approaches 0 or 1, reflecting the flattening of the logistic curve at its extremes.
In logistic regression, how is a one-unit increase in the predictor variable reflected in odds?
As a consistent multiplicative change in odds determined by exp(b).
What are the two ways to interpret logistic regression coefficients?
As log odds (the raw b coefficients) and as odds ratios (the exponentiated b, or exp(b)).
What is the null hypothesis (H₀) when interpreting logistic regression coefficients?
That the predictor has no effect on the outcome.
What does b=0 imply in terms of log odds?
There is no change in log odds as the predictor increases.
What does exp(b)=1 imply in terms of odds ratios?
There is no multiplicative change in odds with each unit increase in the predictor.
What test evaluates whether a logistic regression coefficient b is significantly different from 0?
The Wald Test.