Econometrics Flashcards

Question

Norm vector

Answer 1

||a|| = (a'a)^1/2 The norm of the vector a is the square root of the scalar product of a with itself.

Answer 2

If A has an inverse matrix, it is nonsingular if and only if|A| ≠ 0 A*A^-1 = A^-1 * A = In

Answer 3

|A| = ad - bc Square matrix only - |AB| = |A||B| = |B||A| = |BA|

Answer 4

set of possible outcomes from a random experiment (e.g. flipping a coin) discrete, and continuous

Answer 5

p1, p2, . . . , pm p1 = P(X = x1) p2 = P(X = x2) and so on....

Answer 6

is a weighted average of all possible values of X, with weights determined by the probability density function

Answer 7

Probability Density Function (PDF)

Answer 8

E(X) = p1x1 + p2x2 + .... + pmxm

Answer 9

For any constant c, E(c) = c

Answer 10

For any constants a and b, E(aX + b) = aE(X) + b

Answer 11

E(X + Y + Z) = E(X) + E(Y ) + E(Z)

Answer 12

E(a + bX + cY + dZ) = a + bE(X) + cE(Y ) + dE(Z)

Answer 13

No. "go through" asks if you can pull constants or functions out of the expectation? a.k.a rearrange the equation and get the same result. non linear = squares, logs, or products e.g. E(XY) ≠ E(x) E(Y) But when X and Y are independent, it can. When X can't tell you something about Y

Answer 14

Let’s define two random variables: Let’s define two random variables: 𝑋 = 1 if the first coin is Heads, 𝑋 = 0 if Tails 𝑌 = 1 if the second coin is Heads, 𝑌 = 0 if Tails Since the coins don’t influence each other: 𝐸 ( 𝑋 𝑌 ) = 𝑃 ( both heads ) = 𝑃 ( 𝑋 = 1 and 𝑌 = 1 ) = 0.25 Each coin is fair, and the two tosses are independent.

Answer 15

P(X ≤ xmed ) = 0.5

Answer 16

We are not told that the outcomes are equally likely (c) We cannot compute E(X) because we do not know the probability of each outcome

Answer 17

the expected distance from X to its mean, measuring how spread out the data is

Answer 18

µ(𝑋) (or mu)

Answer 19

σ²ₓ = Var(X) = E{(X − μₓ)²} so if we had 6 values (going from 1-6), and the E(X) was 3.5 then Var(X) = ((1-3.5)^2 + (2-3.5)^2 + (3-3.5)^2 + (4-3.5)^2 + (5-3.5)^2 + (6-3.5)^2) / 6 = 2.92 so it dents to deviate by about 2.92 (squared units) from the mean

Answer 20

σₓ = SD(X) = √E{(X − μₓ)²} thus, σₓ = SD(X) = √Var(X)

Answer 21

we need to calculate 2 things here, the return (expected value) and the risk (variance) For the $1 bet: E(5X) = 5E(X) | return increases 5x Var(5X) = 25Var(X) | risk increases 25x why 25? because Var(aX) = a^2Var(X) For the 5x20 cent bets: E(X1 + X2 + X3 + X4 + X5) = 5E(X) | return increases 5x Var(X1 + X2 + X3 + X4 + X5) = 5Var(X) | risk increase 5x 5Var(X)Note: here we are not averaging returns/var conclusion: 5x20 cents is the better option

Answer 22

1 Observation: E(X) = mu Var(X) = sigma squared 5 Observations: E(X bar) = ⅕(X1 + X2 + X3 + X4 + X5) = mu Var(X bar) = Var (⅕(X1 + X2 + X3 + X4 + X5)) = 1/25[(Var(X1) + Var(X2)...] = 1/25[⅕(sigma squared)] = ⅕(sigma squared). Note: here we ARE averaging returns/var

Answer 23

measures how two variables change together (proportional or inversely proportional) can be misleading if X and Y differ in units (100s v 1000s)

Answer 24

Cov(X,Y) = E((X−μX) (Y−μY)) = E(XY) − μXμY You're looking at how far X is from its mean, and how far Y is from its mean Then taking the average of their product

Answer 25

If Cov ( 𝑋 , 𝑌 ) > 0 Cov(X,Y)>0, then on average: 𝑋 > 𝐸[𝑋] ⇒ 𝑌 > 𝐸[𝑌] , and 𝑋 < 𝐸[𝑋] ⇒ 𝑌 < 𝐸 [𝑌] When X is above average, Y tends to be above average too When X is below average, Y tends to be below average → They “move together”

Answer 26

If Cov ( 𝑋 , 𝑌 ) < 0 Cov(X,Y)<0, then on average: 𝑋 > 𝐸 [𝑋] ⇒ 𝑌 < 𝐸 [𝑌] , and 𝑋 < 𝐸 [𝑋] ⇒ 𝑌 > 𝐸 [𝑌] When X is above average, Y tends to be below average When X is below average, Y tends to be above average → They “move oppositely”

Answer 27

Could mean they’re independent Or just that they have no linear relationship — other types of dependence might still exist

Answer 28

If X and Y are independent Cov(X, Y) = 0 But, if X and Y are dependent Cov(X, Y) does not necessarily ≠ 0

Answer 29

Where a and b are constant → Cov(aX, bY) = abCov(X, Y) unlike expected value, they can do scalar, and cross-products

Answer 30

For any constant c, Var(c) = c.

Answer 31

Var(2X − Y ) = 4Var(X) + Var(Y) = 4 × 4 + 9 = 25

Answer 32

For any constants a and b, Var(aX + b) = Var(X)

Answer 33

Var(aX + bY) = a^2 Var(X) + 2abCov(X, Y) + b^2 Var(Y) Var(aX - bY) = a^2 Var(X) - 2abCov(X, Y) + b^2 Var(Y)

Answer 34

Corr(X, Y ) = Cov(X, Y ) / sd(X)sd(Y )

Answer 35

Two random variables that have non-zero covariance or correlation are statistically dependent, meaning that knowing the outcome of one of the two random variables gives us useful information about the other.

Answer 36

You’re multiplying each value of Y by how likely it is, and summing it all up E[Y] = y1⋅P(Y=y1) + y2⋅P(Y=y2)+…

Answer 37

The overall expected value of Y is the average of the expected values of Y given X, weighted by the probability of each X. E(Y) = E(Y∣X=1)P(X=1) + E(Y∣X=2)P(X=2) + E(Y∣X=3)P(X=3)

Answer 38

P (y|x) = P (x and y)/P (x)

Answer 39

E[Y|X] = y1 f(y1|X) + y2 f(y2|X) E[0.2|0.8 = 1] = 1 x 0.20 + 2 x 0.80 = 1.80

Answer 40

Difference between u and uhat “Population” = the universal reality (y=β0+β1x1+u) “Sample” = the observable universe → hat (y=βhat0+βhat1x1+uhat) sample mean is ESTIMATOR for population mean

Answer 41

yi=β0+β1xi+ui with E(ui∣xi)=0

Answer 42

o^2 / SST(1-R)

Answer 43

sqrt / o(hat)^2/ SST(1-R)

Answer 44

t = B(hat)1 - B0 / se(B(hat)1)

Answer 45

n-k-1 n = sample number k = no. of coefficients incl. intercept q = number of restrictions

Answer 46

F = ((SSRr - SSRur) / q) / (SSRur / (n-k-1))

Answer 47

R^2 = SSE/SST = 1 − SSR/SST

Answer 48

1- ((SSR / (n-k-1)) / (SST / (n-1)))

Answer 49

ui = yi − E(yi∣xi)

Answer 50

try to find the best fitted line. estimators of β0 and β1 are the values of b0 and b1 which minimise the sum of squared residuals (SSR).

Answer 51

β^0= yˉ − β^1xˉ

Answer 52

β^1= hat(Cov(x,y)) / hat(Var(x)) also = o(hat)/o(hat)^2

Answer 53

y(hat)i = B(hat)0 + B(hat)1

Answer 54

u(hat)i = yi - B(hat)0 - B(hat)1

Answer 55

Total sum of squares: measures of TOTAL sample variation in y Measures how much Y varies overall (without using the model) - e.g. if we only look at people's heights without considering any factors

Answer 56

Explained sum of squares: measure of sample variance in Y(HAT) measures how much Y varies after applying the model (explained by X) - e.g. if height differences can be explained by age and genetics

Answer 57

Residual sum of squares: measures of sample variation in U(HAT) Measures how much Y is not explained by the model (random error) - e.g. if two people of the same age and genetics have different heights, that's unexplained variation - When you add more variables, SSR always decreases regardless of the quality of the variable

Answer 58

SST = SSE + SSR

Answer 59

Variance * (n - k - 1)

Answer 60

tells us how well a set of predictor variables is able to explain the variation in the response variable, adjusted for the number of predictors in a model.

Answer 61

We are 95% confident that the population parameter lies between two values. if 0 is NOT included in the confidence interval -> the x factor is STATISTICALLY SIGNIFICANT, which means there is an effect (reject null)

Answer 62

B(hat)j +- ta/2 * se(b(hat)j)

Answer 63

SSR: Σ (yi - y(hat)i)^2 SSE: Σ (y(hat)i - yˉ )^2 SST: Σ (yi - yˉ )^2

Answer 64

β(hat) = (X′X)^-1 * X′y

Answer 65

t(n-k-1) n = sample number k = no. of coefficients incl. intercept

Answer 66

t = B(hat)1 - B0 / se(B(hat)1)

Answer 67

1. Report equations 2. Formulate null (Ho) / alternative (H1) 3. Pick test statistic under Ho 4. Calculate calc value 5. Calculate crit value 6. Decision rule 7. Decision 8. Conclusion

Answer 68

Step 1: Report equations → Write estimated regression model (incl. standard errors in brackets underneath each coefficient) Step 2: Formulate H₀ and H₁ → State the null and alternative hypotheses clearly. Step 3: Test statistic under H₀ → Choose the appropriate test statistic formula (usually a t-statistic). Step 4: Calculate calc value Step 5: Calculate crit value → Find the critical value Step 6: Decision rule → Define your rejection rule (e.g. reject H₀ if |tₐ| > tₖᵣᵢₜ). Step 7: Decision → Compare your calc and crit values → reject or fail to reject H₀. Step 8: Conclusion → Write a contextual interpretation in plain English.

Answer 69

1. Write out the estimated population equation (with parameters under of standard errors) 2. H0: β1 = 0, H1: β1 > 0 3. Test statistic under the null distribution t = B(hat)1 - B0 / se(B(hat)1) ~ t(n-k-1) 4. using a significance level of a = 0.05 tcalc = t = B(hat)1 - B0 / se(B(hat)1) 5. tcrit = t(n-k-1) 6. Reject null if tcalc > tcrit 7. Since a > b, then we reject the null hypothewsis and conclude the higher of x, the higher the y, holding constant of all other factors

Answer 70

1. State the population model (Always include i and u) 2. State the estimated model (Always include i, hats but NEVER u) 3. Report the estimated model coefficients (Check decimal places, rounding, and sign) 4. Report the standard errors (Check decimal places and rounding, always directly under the coefficient) 5. Report R-squared (Usually to the right side or the bottom) 6. Optional significance stars (*p-val < 0.05, **p-val < 0.01, ***p-val < 0.001)

Answer 71

population model: yi= β0 + β1xi + ui estimated model: y(hat)i= β(hat)0 + β(hat)1xi no 'u' because it cannot be observed

Answer 72

p value < 0.05 we reject (statistically significant) p value > 0.05 we fail to reject (statistically insigificant)

Answer 73

If femaleᵢ = 0: E(wageᵢ) = β₀ + β₁educᵢ → represents men If femaleᵢ = 1: E(wageᵢ) = (β₀ + δ₀) + β₁educᵢ → represents women * δ₀ measures the difference in intercept between women and men. (intercept slope) - only need 1 dummy variable if we only have 2 possibilities (male/female)

Answer 74

Intercept (δ₀) = difference in base outcome between groups Slope (δ₁) = difference in effect of another variable between groups (e.g. education effect on male and female)

Answer 75

E(wageᵢ | femaleᵢ, educᵢ) = β₀ + δ₀femaleᵢ + β₁educᵢ + δ₁(femaleᵢ × educᵢ) * Implies: ○ For men: E(wageᵢ) = β₀ + β₁educᵢ ○ For women: E(wageᵢ) = (β₀ + δ₀) + (β₁ + δ₁)educᵢ * δ₁ captures the difference in slope between men and women.

Answer 76

Test for whether dummy variables jointly affect the dependent variable. * Joint hypothesis: ○ H₀: Both δ₀ and δ₁ = 0 (gender has no effect) ○ H₁: At least one ≠ 0 (gender does affect wage) * Two models: ○ Unrestricted includes dummy and interaction terms ○ Restricted includes only non-dummy regressors * Use F-test for joint hypothesis testing:

Answer 77

Unrestricted: wageᵢ = β₀ + δ₀femaleᵢ + β₁educᵢ + δ₁(femaleᵢ × educᵢ) +ui Restricted: wageᵢ = β₀ + β₁educᵢ + u

Answer 78

No, F-tests can show different results when the dummies are not in their isolated stage

Answer 79

when you fail to omit one dummy or omit the intercept as a base measure against the other variables

Answer 80

log(salaryᵢ) = β₀ + β₁financeᵢ + β₂consprodᵢ + β₃utilityᵢ + β₄log(salesᵢ) + β₅ROEᵢ + uᵢ - β₀ measures the average log salary in the transport industry (i.e the base dummy) ○ β₁ = difference in log salary between finance and transport. ○ β₂ = difference in log salary between consumer products and transport. ○ β₃ = difference in log salary between utility and transport. - β₂ − β₁ measures the difference between the average log salary in the consumer product firms and finance firms.

Answer 81

when the dependent variable is in log: log(ŷ) = β̂₀ + β̂₁x₁ + ... + β̂ₖxₖ β1(hat) is the change in log(y)(hat) as x1 increases by 1 unit, all else constant say B1 = 17, instead of normal where we would say, when x increases by 1 y increases by 17 on average, we would say y increase by 0.04% on average. as it is logged. we get this value through the e formula: For dummy variables: % change in y due to category = 100(e^β̂ − 1)%

Answer 82

Total debt and education → log level Wage and GDP → level log GDP and unemployment → log log

Answer 83

y= β0 + β1x + β2x2 + u derivative: Marginal effect of x on y : β1 + 2β2x e.g. sleep vs age: Sleep might go down with age, but after a point, it might go up again → quadratic pattern.

Answer 84

- if y must be positive - if % changes make more sense - if data is skewed don't log years (age, education), % variables, or negatives

Answer 85

Log-Level model: log(y) = β0 + β1x + u β1 → % change in y when x increases by 1 unit. Level-Log model: y = β0 + β1log(x) + u β1/100 → change in y when x increases by 1%. Log-Log model: log(y) = β0 + β1log(x) + u %β1 → % change in y when x increases by 1% → elasticity notes; - The r squared for a log regression for y is different to the original - when tested on a variable (e.g. B1) it is holding all other factors constant

Answer 86

y= β0 + β1x + β2x2 + u The coefficients of x and x^2 on their own do not have meaningful interpretation. Find when derivative = 0 Maximum when B2 < 0, minimum when B2>0 Min or max = -a/2b When in doubt about whether to add a quadratic term, we can add it and check its statistical significance or see if it improves the adjusted R squared

Answer 87

1. Is effect of x constant or changing over x? 2. is there a peak/optimal x for y? (wage and age)

Answer 88

R^2, Adjusted R^2, or Information criteria (AIC, HQ, BIC)

Answer 89

R^2 = SSE/SST = 1 - ((SSR / (n-1)) / ((SST / (n-1)) SSR never goes up with more regressions, R^2 not reliable when models differ in # of regressors

Answer 90

Adjusted R^2 = 1 - ((SSR / (n-k-1)) / ((SST / (n-1)) Penalizes adding too many variables. Only goes up if adding a variable really helps.

Answer 91

All ICs balance two things: 1. How well the model fits (low SSR = good) 2. How simple the model is (fewer variables = good) General formula: IC= c + ln(SSR) + P(k)/n penalty increases with # of regressors, and lowers SSR. we prefer the model = the LOWEST IC value and HIGHEST ADJ R^2

Answer 92

AIC = c1 + In(SSR) + 2k/n

Answer 93

HQ = c2 + In (SSR) + 2k ln(ln(n)) / n

Answer 94

BIC = c3 = In(SSR) + kln(n) /n

Answer 95

| y1 | | x11 | | x1k | | u1| | y2 | | x21 | | x2k | | u2 | | : | = B0 + | : | B1 + ' ' ' ' | : | Bk + | : | | yn | | xn1| | x3k | | un |

Answer 96

R^2 = 1 - SSR/SST B(hat) = (X'X)^-1 X'y AND B1(hat) = cov(x,y)hat / var(x)hat meaning: Note: the difference between the population parameter β andits OLS estimator β hat.- β is constant and does not change.- Β hat is a function of sample and its value changes fordifferent samples. X'u(hat) = 0 Meaning: The vector of residuals must be orthogonal to every column ofthe X (i.e when multiplied must equal 0)

Answer 97

best linear unbiased estimator’ for Beta

Answer 98

is an unbiased estimator of a parameter of interest if its EXPECTED VALUE is the PARAMETER OF INTEREST E(β^) = β or matrix form: E(β^) = E[(X'X)^-1X'y] = β β^ = estimator β = parameter of interest

Answer 99

A1: The population model is linear in parameters: y = Xβ + u A2: Columns of X are linearly independent A3: conditional mean of errors is zero: E(u|X) = 0 A4: Homoskedasiticty and no serial correlation: Var (u|X) = o^2*ln A5: Errors are normally distributed u|X ~ N(0, o^2*ln) if these assumptions do not hold, the null distribution and T statistics are no longer reliable

Answer 100

E (Y) = Ex [E(Y | X)]

Answer 101

E(β^) = E[(X'X)^-1X'y] = β A1: The population model is linear in parameters: y = Xβ + u A2: Columns of X are linearly independent A3: conditional mean of errors is zero: E(u|X) = 0

Answer 102

Var(β^) = σ^2 (X′X)^−1 choose the estimator with the smallest variance

Answer 103

The matrix contains: 2x2 - Variances of each β^j on the diagonal - Covariances between β^j and β^k off-diagonal. top left: Var(B(hat)0) top right: Cov(Bhat0, Bhat1) bottom left: Cov(Bhat1, Bhat 0) bottom right: Var((Bhat)1)

Answer 104

B(hat) ~ N(B, o^2 9X'X)^-10

Econometrics Flashcards

(131 cards)