# 11 Binary dependent variables Flashcards

1
Q

linear probability model

A

regression model with a binary dependent variable

2
Q

Disadvantages of the linear probability model

A
• Predicted probability can be above 1 or below 0!

- Error terms are heteroskedastic

3
Q

nonlinear probability models

A

Pr(Y = 1) = G(Z)
with Z = β0 + β1X1i + ··· + βkXki
and 0≤G(Z)≤1

Probit: G(Z) = Φ(Z)
Using the cumulative standard normal distribution function Φ(Z )

Logit: G(Z) = 1 / (1 + e^{-Z})
Using the cumulative standard logistic distribution function

Remember:
F(z) = Pr(Z ≤ z)

4
Q

the method used to estimate probit and logit models

A

Maximum Likelihood Estimation (MLE)

The models are nonlinear in the coefficients, so they can’t be estimated by OLS.

5
Q

likelihood function

A

The likelihood function is the joint probability distribution of the data, treated as a function of the unknown coefficients.

6
Q

maximum likelihood estimator (MLE)

A

The maximum likelihood estimator (MLE) are the values of the coefficients that maximize the likelihood function.

MLE’s are the parameter values “most likely” to have produced the data.

7
Q

If Yi is binary, then E(Yi | Xi) =

A

If Yi is binary, then E(Yi | Xi) = Pr(Yi = 1 | Xi)