PW1 Flashcards

(33 cards)

1
Q

What is a quantitative variable

A
  • A variable measured in natural units and satisfies cardinality
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a continuous variable and give two examples

A
  • A variable that can have an infinite number of different values between two points
  • Time, Age
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a count variable and give two examples

A
  • A variable that takes specific values indicating a counting of some kind
  • Number of visits to the hospital, Number of pens
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Categorical variable and give two examples

A
  • A variable that expresses some sort of qualitative trait of the objects studied
  • Eye color: blue, brown, hazel, Smoking status: smoker, non-smoker
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a Nominal variable

A
  • A nominal variable takes on levels that have no numerical value/interpretation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is an Ordinal variable

A
  • A variable that has an arbritrary numeric scale where the distance is not possible to establish, so order matters
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Whats the difference between a Binary variable and a Many Category variable

A
  • A binary variable only has two levels where as a many category variable will take more than two levels
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give two examples of a nominal binary and many catagories variable

A
  • Binary: Health status & Ethnicity
  • Many Categories: Type of bycicle, Ethnicity
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Give two examples of an ordinary binary and many categories variable

A
  • Binary: Mark(Pass/Fail), Student Status(Under/Postgrad)
  • Many Categories: Health Status(Poor, Fair, Good), Rating Question(1 to 5)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the slope of a linear regression represent

A
  • The slope is the effect on the average y of a unitary change in x
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How does interpretation of the slope change as extra explanatory variables are added to a regression

A
  • It doesn’t, each variable is interpreted individually and instead of a line a plane is fitted
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How do Dummy variables work in a regression

A
  • The variable will take the value of 1 if a condition is satisfied & 0 if not
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does the reference level mean for a Dummy variable

A
  • The level that takes the value of zero is often called the reference level
  • This is because this is the level that the other level is compared to
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

How is the coefficient for a Dummy variable interpreted for a basic regression model

A
  • b1 is the difference in the average y of D(1) compared to D(0)
  • Shows the difference in Average y when we compare D = 1 to the level D = 0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How can we derive the effect of a Dummy variable

A
  • We can take expectations of the regression model and take the difference of when D = 1 and D = 0
  • We assume E(X|u) = 0
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How are Categorical variables used in a regression model

A
  • In a similar way to dummys, where multiple variables are in the model but are compared to the same reference level
17
Q

What are interaction effects and why are the important

A
  • They capture the effect of two variables working in combination
  • We can have interactions between continuous variables, Dummys and both combined
18
Q

How does the interpretation of x change when the model is Log-Level

A
  • Log(y) = b0 + b1 * x + u
  • A unitary change in x implies a (100 * b1) percentage change in y
  • Known as semi-elasticites
19
Q

How does the interpretation of x change when the model is Log-Log

A
  • Log(y) = b0 + b1 * Log(x) + u
  • A percentage change in x results in a b1 percentage change in y
  • Known as common elasticity
20
Q

How does the interpretation of x change when the model is Level-Log

A
  • y = b0 + b1 * log(x) + u
  • A percentage change in x results in a b1/100 change in y
21
Q

What is MLR 1

A
  • MLR1: Linearity in Parameters
  • The model can be written as y = b0 + b1 * x + etc
22
Q

What is MLR 2

A
  • MLR2: Random sampling from the population
  • There is no sample selection, the observations are randomly extracted from the population
23
Q

What is MLR 3

A
  • MLR3: No perfect collinearity in the sample
  • None of the independent variables in the sample have an exact linear relationship
  • Independent variables can be correlated, but not perfectly correlated
24
Q

What is MLR4

A
  • MLR4: E(u|x1,…,xk) = E(u) = 0
  • Under MLR4, we have exogenous explanatory variables
25
What does it mean for OLS estimators if MLR1-4 hold
- OLS estimators are unbiased
26
What does it mean for an estimator to be unbiased
- If, on average, it hits the true parameter value - E[βj hat] = βj for j = 1,...,k
27
What is MLR5
- MLR5: Var(u|x1,...,xk) = Var(u) = σ^2 - The variance in the error term, conditional on x1 - xk is the same for all combinations of outcomes of the explanatory variables - Homoskedasticity - If not held, we have Heteroskedasticity
28
Why is the size of Var(βj hat) pratically important
- A larger variance means a less precise estimator, which means larger confidence intervals and less accurate hypothesis tests
29
What are the assumptions MLR1 - MLR5 called
- The Gauss-Markov assumptions
30
Under the Gauss-Markov assumptions, what is the OLS estimator?
- BLUE - Best: Has the smallesst variance - Linear: Can be expressed as a linear function of the data on the dependent variable - Unbiased: E(βj hat) = βj - Estimator: It is a rule that can be applied to any sample of data to produce and estimate
31
What do we need to perform statistical inference
- The full sampling distribution of βj - MLR5 tells us nothing about the sampling distribution - We use MLR6 to help
32
What is MLR6
- MLR6: the population error u is independent of the explanatory variables and is normally distributed with zero mean and variance σ^2 - Assumptions MLR1 - MLR6 are called the classical linear model (CLM) assumptions - Under CLM assumptions βj hat ∼ N(βj , var (βj hat)
33