Ordinal Logistic Regression Flashcards

Question

How do you obtain coefficient estimates in OLR in a proportional odds model?

Answer 1

gologit2 , pl Log odds would be the same in all part of the table Intercepts for each dichotomisation vary 'pl' stands for parallel lines

Answer 2

Just like in BLR, we can use the estimated coefficients to calculate predicted probabilities for sample members with any combination of covariate values In OLR (proportional and non-proportional odds models) we can calculate the probability of being in any one of the outcome categories

Answer 3

Two different dichotomisations, with different slope coefficients, probabilities and intercepts in each equation logit(p1) = c1 + β11x1 + β21x2 + ... + βk1xk logit(p2) = c2 + β12x1 + β22x2 + ... + βk2x2

Answer 4

'pl' option

Answer 5

Null model: logit[P(Lifesat > j)] = cj j = {low, medium} We just have the intercept cj for dichotomisation j

Answer 6

We're not usually interested in the null model for its own sake, but serves as a comparison point for other models

Answer 7

Table 1: Baseline log odds of being in a higher than 'low' category Table 2: Baseline log odds of being in a higher than 'medium' category

Answer 8

The log odds of the observed proportions in the dataset. E.g., with life satisfaction (J = 3, taking the intercept for 'higher than low': P(Lifesat > "Low") = exp(β0 for low) / 1 + exp(β0 for low) The result of the reverse logit transformation is the proportion of the sample reporting higher than low life satisfaction

Answer 9

Graphically illustrate the relationship between the continuous predictor and ordinal outcome. This can be done by, for example, plotting predicted probabilities, with curvatures indicating non-linearity. The continuous predictor e.g., age with a mean of 50 should also be centred (normally around the mean) so the intercepts can be interpreted as odds for those aged 50

Answer 10

One interpretation per independent variable If there are other covariates in the model, we would also be controlling for other independent variables

Answer 11

To ensure missing data do not affect number of observations

Answer 12

Probability associated with LRT comparing model computed with the null model

Answer 13

H0: None of the independent variables in the current model predicts the DV H1: At least one of the independent variables predicts the DV Under H0, the LRT follows a chi-squared distribution with df equal to the number of independent variables in the model

Answer 14

LRT = -2 x (LLnull - LLcurrent model)

Answer 15

LRTs in OLR are analogous to BLR By default, Stata displays the LRT comparing the log likelihood (LL) of the estimated model with the LL of the null model

Answer 16

From the log-likelihoods of the null model and the current model

Answer 17

- The DV is ordinal - For numeric/continuous predictors, the relationship between the IV and the log odds of the outcome is linear - Proportional odds: The coefficient for every IV is assumed to be the same for any dichotomisation of the DV. This is sometimes called the "parallel regression" assumption. The proportional odds assumption can be relaxed in a non-proportional odds model

Answer 18

- Dichotomising the DV in all possible ways - Fitting a BLR on each dichotomisation - Comparing the estimated coefficients from each of these BLRs If the ORs are different in each dichotomisation, there may be evidence that the odds are not proportional.

Answer 19

BLR for "Y > 0" (lifesat > 'low') BLR for "Y > 1" (lifesat > 'moderate') BLR coefficients for each dichotomisation This is the test for each individual IV

Answer 20

In the first row of the output, Stata displays an omnibus test. This test the H0 that the odds are proportional for all IVs. A good strategy is to first look at the omnibus test ("All"). If the result is not significant, we may assume proportional odds. If the result is statistically significant, we look at the individual tests to find out which variable may be problematic

Answer 21

- The coefficients are reasonably similar in the BLRs (indicating that the population ORs may be equal) - The Brant test statistics all have large p-values

Answer 22

- With very small samples, the test may lack power and fail to detect important departures from the proportional odds assumption - With very large samples, the test may be overly sensitive and detect unimportant departures from the proportional odds assumption Therefore, you should always inspect the estimated coefficients in the top part of the output as well as looking at the Brant test p-values themselves. Use your judgement in deciding whether the assumption is reasonable

Answer 23

Adding a quadratic term. This would have its own coefficient in the model (must be included with the centred variable)

Answer 24

Make the interpretation and statistical analysis easier by reducing the chance of multicollinearity E.g., if there was an age variable of 20-70, which was then squared, the age and age-squared variable may be highly correlated. By shifting age down to the mean (50), there will be negative and positive numbers, making it appear more normally distributed and less likely to be highly correlated with the squared term

Answer 25

We can use the LRT - one model with the quadratic term and one without H0: Model 2 does not add predictive power compared to Model 1 H1: Model 2 predicts the DV better than Model 1 (including a squared term for the IV, alongside the linear effect, improves the prediction)

Answer 26

The relationships are allowed to be curved

Answer 27

To indicate that the coefficients are free to differ between equations

Answer 28

- Inefficient, since proportional odds can safely be assumed for some variables - More complicated to interpret than a proportional odds model, since there is a larger number of parameters (coefficients, ORs)

Answer 29

logit[P(Lifesat > J)] = cj + β1 x Female + β2 x Agecentred + β3 x Agecentred_continuous + β4j x Friends - The coefficients for β1, β2, and β3 are the same across equations (proportional odds assumed for female, age, and age2) - The subscript j in β4j indicates that the slope coefficient of friends is free to vary across equations (proportional odds is not assumed for friends)

Answer 30

gologit2 , or pl est store propodds gologit2 , or pl() est store partial gologit2 , or est store noprop lr test partial propodds lr test noprop partial

Answer 31

Can test partial proportional to proportional odds model. Partial proportional odds model is nested within proportional odds model. Test non-proportional odds model against partial proportional odds model

Answer 32

Stata will give a table with p-values for proportional odds, partial-proportional odds, and the non proportional odds models, comparing the latter two to the first. - There is evidence that the partial proportional odds model fits the data better than the full proportional odds model (p = 0.038). The partial proportional odds model is best supported by the data. - There is no strong evidence that the non-proportional odds model improves the fit compared to the partial proportional odds model (p = 0.503).

Answer 33

- Model fit: the model should predict the outcome reasonably well; measured by the log likelihood. - Parsimony: a smaller, simpler model is preferred to a larger, more complicated model; measured by the number of parameters (fewer parameters = simple model) We can use LRTs to compare models. In general, it's advised to choose the simplest plausible model that fits the data reasonably well

Answer 34

- Assuming proportional odds leads to a simpler model than non-proportional odds - Assuming linearity (in the log odds) is simpler than using non-linear terms (e.g., age2) But we should allow for non-proportional odds (e.g., via a partial proportional odds model) and/or non-linearity if we think that this improves the model fit

Ordinal Logistic Regression Flashcards

(61 cards)