PW1 Flashcards by Tom Cane

What is a quantitative variable

A variable measured in natural units and satisfies cardinality

How well did you know this?

Not at all

Perfectly

What is a continuous variable and give two examples

A variable that can have an infinite number of different values between two points
Time, Age

How well did you know this?

Not at all

Perfectly

What is a count variable and give two examples

A variable that takes specific values indicating a counting of some kind
Number of visits to the hospital, Number of pens

How well did you know this?

Not at all

Perfectly

What is a Categorical variable and give two examples

A variable that expresses some sort of qualitative trait of the objects studied
Eye color: blue, brown, hazel, Smoking status: smoker, non-smoker

How well did you know this?

Not at all

Perfectly

What is a Nominal variable

A nominal variable takes on levels that have no numerical value/interpretation

How well did you know this?

Not at all

Perfectly

What is an Ordinal variable

A variable that has an arbritrary numeric scale where the distance is not possible to establish, so order matters

How well did you know this?

Not at all

Perfectly

Whats the difference between a Binary variable and a Many Category variable

A binary variable only has two levels where as a many category variable will take more than two levels

How well did you know this?

Not at all

Perfectly

Give two examples of a nominal binary and many catagories variable

Binary: Health status & Ethnicity
Many Categories: Type of bycicle, Ethnicity

How well did you know this?

Not at all

Perfectly

Give two examples of an ordinary binary and many categories variable

Binary: Mark(Pass/Fail), Student Status(Under/Postgrad)
Many Categories: Health Status(Poor, Fair, Good), Rating Question(1 to 5)

How well did you know this?

Not at all

Perfectly

What does the slope of a linear regression represent

The slope is the effect on the average y of a unitary change in x

How well did you know this?

Not at all

Perfectly

How does interpretation of the slope change as extra explanatory variables are added to a regression

It doesn’t, each variable is interpreted individually and instead of a line a plane is fitted

How well did you know this?

Not at all

Perfectly

How do Dummy variables work in a regression

The variable will take the value of 1 if a condition is satisfied & 0 if not

How well did you know this?

Not at all

Perfectly

What does the reference level mean for a Dummy variable

The level that takes the value of zero is often called the reference level
This is because this is the level that the other level is compared to

How well did you know this?

Not at all

Perfectly

How is the coefficient for a Dummy variable interpreted for a basic regression model

b1 is the difference in the average y of D(1) compared to D(0)
Shows the difference in Average y when we compare D = 1 to the level D = 0

How well did you know this?

Not at all

Perfectly

How can we derive the effect of a Dummy variable

We can take expectations of the regression model and take the difference of when D = 1 and D = 0
We assume E(X|u) = 0

How well did you know this?

Not at all

Perfectly

How are Categorical variables used in a regression model

Study These Flashcards

In a similar way to dummys, where multiple variables are in the model but are compared to the same reference level

What are interaction effects and why are the important

Study These Flashcards

They capture the effect of two variables working in combination
We can have interactions between continuous variables, Dummys and both combined

How does the interpretation of x change when the model is Log-Level

Study These Flashcards

Log(y) = b0 + b1 * x + u
A unitary change in x implies a (100 * b1) percentage change in y
Known as semi-elasticites

How does the interpretation of x change when the model is Log-Log

Study These Flashcards

Log(y) = b0 + b1 * Log(x) + u
A percentage change in x results in a b1 percentage change in y
Known as common elasticity

How does the interpretation of x change when the model is Level-Log

Study These Flashcards

y = b0 + b1 * log(x) + u
A percentage change in x results in a b1/100 change in y

What is MLR 1

Study These Flashcards

MLR1: Linearity in Parameters
The model can be written as y = b0 + b1 * x + etc

What is MLR 2

Study These Flashcards

MLR2: Random sampling from the population
There is no sample selection, the observations are randomly extracted from the population

What is MLR 3

Study These Flashcards

MLR3: No perfect collinearity in the sample
None of the independent variables in the sample have an exact linear relationship
Independent variables can be correlated, but not perfectly correlated

What is MLR4

Study These Flashcards

MLR4: E(u|x1,…,xk) = E(u) = 0
Under MLR4, we have exogenous explanatory variables

What does it mean for OLS estimators if MLR1-4 hold

- OLS estimators are unbiased

What does it mean for an estimator to be unbiased

- If, on average, it hits the true parameter value - E[βj hat] = βj for j = 1,...,k

What is MLR5

- MLR5: Var(u|x1,...,xk) = Var(u) = σ^2 - The variance in the error term, conditional on x1 - xk is the same for all combinations of outcomes of the explanatory variables - Homoskedasticity - If not held, we have Heteroskedasticity

Why is the size of Var(βj hat) pratically important

- A larger variance means a less precise estimator, which means larger confidence intervals and less accurate hypothesis tests

What are the assumptions MLR1 - MLR5 called

- The Gauss-Markov assumptions

Under the Gauss-Markov assumptions, what is the OLS estimator?

- BLUE - Best: Has the smallesst variance - Linear: Can be expressed as a linear function of the data on the dependent variable - Unbiased: E(βj hat) = βj - Estimator: It is a rule that can be applied to any sample of data to produce and estimate

What do we need to perform statistical inference

- The full sampling distribution of βj - MLR5 tells us nothing about the sampling distribution - We use MLR6 to help

What is MLR6

- MLR6: the population error u is independent of the explanatory variables and is normally distributed with zero mean and variance σ^2 - Assumptions MLR1 - MLR6 are called the classical linear model (CLM) assumptions - Under CLM assumptions βj hat ∼ N(βj , var (βj hat)

PW1 Flashcards

(33 cards)