Categorical regression Flashcards
(10 cards)
Types of categorical response variables
- Ordinal: sequential or not
- Nominal (not ordered)
Multinomial distrbution
MN(m, π)
f(ycx1|π) = π1y1 * π2y2 * … πcyc * (1-Σ πr)m-Σyr
- E(Y)cx1 = mπ
- cov(Y)cxc = m [πi(1-πi) diagonal, -πiπj out]
- Ungrouped data: m=1
- Grouped data: m=ni
(Extended) Multinomial logit
For nominal variable:
πir = h(ηir) = exp[xi‘βr + (wir-wi(c+1))’γ] / [1+Σs=1c exp(xi‘βs + (wis-wi(c+1))’γ)]
πi(c+1) = 1 / [1+Σs=1c exp(xi‘βs)]
ηir = log(πir/πi(c+1))
- Category specific: (wir-wi(c+1)), βir;
- Category unspecific: γ, xi;
- βr is the log(relative risk) of category r compared to the reference category (c+1) and NOT relative to other categories;
- exp(βr) is the effect on the odds;
- If βrj > 0: ODDSr increase relatively to reference category of exp(βrj) with one unit increase of xj. The probability itself is not increasing for increasing xj!
Latent utility formulation for categorical regression
ur = ~ηr+εr
Y=r <=> ur = max of all us
- εi Logistic: P(Y=r) = exp(~ηr - ~η(c+1)) / [1 + Σ exp(~ηs - ~η(c+1))];
- εi Normal(0,1): independent probit;
- εi MVN: multivariate probit.
Cumulative categorical regression model
For ordinal non-sequential variable:
ui = -xi‘β + εi i=1,…G
Yi = r <=> θ(r-1) < ui < θr
P(Yi=r) = F(θr+xi‘β) - F(θ(r-1)+xi‘β)
- NO INTERCEPT for identifiability;
- Fε Logistic: Proportional Odds model;
- Fε Extreme min value: Proportional Hazards model.
Proportional odds model
For ordinal non-sequential variable, ε logistic:
P(Yi <= r) = exp(θr +xi‘β)/[1+exp(θr+xi‘β)]
- OR = exp((xi - ~xi)’β) independent of category.
Proportional hazards / grouped Cox model
For ordinal non-sequential variable, ε extreme min-values:
P(Yi <= r) = 1-exp(-exp(θr + xi‘β))
ML estimation in categorical regression
β^ such that
s(β^) = X’DΣ-1(y-niπ) = 0, with F(β) = X’WX
^β .~ N(β, F-1(^β))
- dii = h’(ηi)
- wii = [h’(ηi)]2/σ2
Xi for ordinal cumulative model
Since we have the c thresholds θ and k regressors (NO intercept):
Xi = cbind( diag(1, c), rep(xi, c) )
Xi for extended multinomial model
We have k+1 regressors β and also γs:
Xi = cbind( diag(xi, c), c(wi1‘-wi(c+1)’, .. wic‘-wi(c+1)’ )