Econometrics Flashcards
How to formulate a model?
- Statement of theory/hypothesis
- Collect data
- Specify mathematical model and stats theory
- Estimate parameters
- Check for model adequacy
- Test hypothesis
- Use model for predictors
Types of data and examples?
Time series: e.g GDP, unemployment. Can be both quantitative (e.g. prices) and qualitative (gender)
Cross-section: Data on variables from one point in time
Pooled: Combination of both
What is a linear regression?
Regression studying the linear relationship between dependant (explained) variables and independent (explanatory)
What is the population regression function?
Mathematical representation of the line of best fit
What does the error term represent?
- stochastic error (random probability)
- Represents variables not in the model
- Randomness of human behaviour
- Errors of measurement
- Ockham’s razor (keep simple until proved inadequate)
How is a sample regression function different?
Used when you can only estimate values using a sample of the data. Error term is a residual (ei) and Y has a ^ as it is an estimate. Get as close to the PRF as we can
How does OLS work?
Aims to minimise the value of the residual sum of ei^2.
Equation for B1 and B2
Mean of Y - B2(Mean of X) = B1
Sum of XiYi - (n)(mean of x)(mean of y).
ALL DIVIDED BY
Sum of Xi^2 - (n)(Mean of X)^2)
OLS properties?
- SRF passes through the sample means
- Mean of residuals is 0
- Sum of residuals and explanatory variables X is 0 (uncorrelated)
Difference between percentage increase, and percentage point increase?
6%- 7% = 1% percentage point increase
((7-6)/6) x 100 = 16.6% percentage increase
Assumptions of OLS model?
-Linear parameters
-X is uncorrelated with U
-E(u l xi) = 0
-Var (u) = δ^2
-No correlation of error terms (autocorrelation)
cov (ui, uj) = 0 (I and J not equal)
-No specification errors
What is homoscedastic variance? How is it calculated?
All variables have the same variance
Sum of residuals squared (RSS) / n -2 (degrees of freedom)
What is the Gauo- Markov theorem?
OLS estimators are BLUE: Best linear unbiased estimators
What is the central limit theorem?
If there is a large number of independent and identically distributed random variables then the distribution of their sum tends to a normal distribution as the sample reaches infinity
When is the T distribution used?
To test the null that Ho: B2 = 0, to see if there is a relationship between X and Y and the true variance is unknown
T test equation?
(b2 - B2) / se (b2) approx equal to t, n - (n-1)
Why might a one tailed test be used?
If you think the value is definitely above 0 for example
What is the total sum of squares ( sum of y^2) equal to?
ESS + RSS
(b2^2 x sum of (x^2)) + Sum of residuals squared
Proof that TSS = ESS + RSS
Y = (Y^) + e (Y - MeanY) = (Y^ - MeanY) + (Y - Y^)(e) y = b2xi + e and Y^ = b2xi Solve out... Sum of y^2 = b2^2 x sum of x^2 + sum of residuals squared
If the line is a good fit what relationship does ESS and RSS have?
ESS > RSS
How can R^2 be calculated?
ESS/TSS
OR
1 - Sum of residuals / sum of y^2
What is a normality test and some examples?
Used to see if data is close to a normal distribution. Tests include histogram of residuals, probability plot and Jarque- bara test
How is the Jarque-Bara test conducted?
Test skewness and kurtosis for a normal distribution match (small value = normal)
n/6 (skew^2 + (kurt - 3)^2 /4)
What is forecasting?
Using the equation to predict the value outputted (only use numbers within the range to avoid extrapolating)