Econometrics Flashcards
(131 cards)
5 Steps in Econometrics
- Come up with a question
- Build a model
- Get data
- Run your model
- Interpret and refine
Cross Sectional data
data measured at same time, each person different
Time series data
data over time period, look at trends.
temporal dependence (a can provide info leading up to b)
Pooled cross-section data
cross-sectional at multiple points in time, survey in 2020, and again in 2023, but with different people
Panel data
Observations on the same cross-section units at different points in time
i.e Earnings in Victoria (worker 1 in 2022, worker 1 in 2023,worker 2 in 2022, worker 2 in 2023)
Ordinary Least Squares (OLS) meaning
Trying to find the line that best fits your data
* We want to draw a line that's as close as possible to all the dots (data points) * OLS does this by -> minimizing the squared distance between each point and the line
Experimental data
The gold standard for measuring causal effects is using data from randomized control trials
Purpose of the linear regression model
To estimate B1 and B0
Observed and Unobserved elements in linear regression formula
u = errors: a random variable which captures the effect on y of factors other than x (unobserved)
B0 = y intercept (unobserved)
B1 = slope between y and x (unobserved)
The data (x1, y1), (x2, y2), . . . ,(xn, yn) is observed.
What does observed and unobserved mean in linear regression formula
observed = we can already see from the data set, such as the explanatory/dependent variable, e.g. person 1 = 12 years of schooling (ev) and earns $50000 (dv)
unobserved = we can’t necessarily see, unless we work it out/run the regression e.g. b1,b0, u
when to add the subscript “i”
when we have a model with sample data/observations, when talking about actual data from your sample. Each observation (person, firm, country, etc.) gets its own little “i”.
when do we use ^ symbol
Before running regression:
We write:
yi=β0+β1xi+ui
- because we don’t know the betas yet.
After running regression (using OLS):
We get estimates:
β^0,β^1
and use those to make predictions:
y^i=β^0+β^1xi
basically our “predicted linear regression”
Define intercept
When [x variables] are equal to zero, the predicted [y variable] will be equal to[number and unit].
e.g: if the median age in the state was 0 and a woman was from a western state, the average birth rate for a woman in that region would be equal to5.572 births per woman.
what does ^ mean and when do we use it
The hat (^) just means “this is an estimate.” You’re no longer talking about the true value, you’re talking about the value you calculated from your data.
Define slope
Controlling for [other variables], the predicted [y variable], on average [increases/decreases] with [number and unit] for each increase in [one unit] ofthe [x variable].
e.g: Controlling for region, the predicted birth rate in a state on average decreases with 0.128 births per woman for each year increase in the medianage in that state
Define Error term
Controlling for [other variables], the [y variable] that is unexplained by these factors is equivalent to [number and unit]
Define Standard error
The standard error for each value is the standard deviation of the residuals. A smaller residual standard error means the regression model is a better fit.
Matrix form
(m x n) where m = row, n = column
row vector
(1 x n) → denoted by a’
column vector
(m x 1) → denoted by a
Transpose matrix
Interchanging the rows and columns → denoted by A
if transpose is the same as original, then it is SYMMETRIC matrix
Trace in matrix
For square matrices only
→ sum of the elements on the principal diagonal
→ denoted by tr(A)
Identity matrix
Has 1 along the principal diagonal
- Pre/post multiplying by I has no effect
Orthogonal vectors
Where two vectors multiplied equals zero