Logistic Regression Flashcards
what is logistic regression (LR) for?
Logistic regression is an example of a non-linear regression model, which is what we need when we have a dichotomous or categorical DV
the assumptions is LR are characteristically ….
less severe – relatively assumption free
What are the 3 main reasons for performing a logistic regression rather than a standard multiple regression?
1) DV is categorical, and therefore 2) Line of best fit will be sigmoidal, not linear, and as such 3) There will be non-normality and heteroscedasticity in the residuals if OLS regression is used, which violates important assumptions of this method
how does LR build a model?
by measuring the deviance of predictors, and including them or excluding them based on their contribution to predicting the outcome variable …. LR says: Does an individual predictor increase or decrease the probability of an outcome?
as apposed to MR… LR uses a dichotomous DV, and …
continuous IVs -
LR is also not…
linear,
the predictive model is called XX
P hat
the residuals are not…..
Residuals are clearly not normal (skewed)
and exhibit …
heteroscedasticity – residuals are all or nothing, and not evenly distributed.
so, Instead of the model fit being linear (which excludes the possibility of using probability, as a linear line can extend past 0 and 1, where probability lies), LOG_REG uses
a non-linear (sigmoidal) line of best ft.
Probability means =
0-1 or a % 0-100) – likelihood of an event occurring
Odds mean =
Odds = Probability of event divided by its component 1-the probability of an event
why does LR use odds?
unpacks the maths nicely - actually it is converted back t p value after using odds
in LR, instead of using the odds (which are xx), we use the …
asymmetric natural log
what is the natural log called in LR
the logit
what does the odds ratio mean in LR?
Odds ratio: relationship between the odds of an event occurring across levels of another variable (by how much do the odds of Y change as X increases by 1 unit?)
and what does the ‘ratio of ratios’ mean?
Ratio of a ratios – the event of an event occurring as a function of levels of another variable. (e.g. odds of males having a disease with the odds of females having disease – i.e. the odds of these odds combined)
why do we present the results in terms of log odds and odds ratio?
as it turns a non-linear relationship into the familiar linear one
this enables us to subsequently …
test whether this coefficient is significantly different from 0 – just like a t-test in MR
the predicted odds range from ?
0 to + ∞
so when p>.50
odds>1 (.50 would be even at 1)
the predicted odds varies ….
varies exponentially with the predictor(s)
in comparison the natural logit ranges…..
from - ∞ to + ∞
it reflects odds of being a case but
varies linearly with the predictor(s)
