R4 - Quant - Introduction To Linear Regression Flashcards

1
Q

The 6 Assumptions of the Linear Regression Model

A

1) The relationship between the dependent variable Y, and the independent variable X is linear in the parameters b0 and b1. This requirement means that b0 and b1 are raised to their first power only and that neither b0 nor b1 is multiplied or divided by another regression parameter (as in b0/b1) The requirement does not exclude X from being raised to a power other than 1
2) The independent variable X is not random
3) The expected value of the error term is 0
4) The variance of the error term is the same for all observations.
5) The error term E is uncorrelated across observations. Consequently E(ei ej) = 0 for all i that are not equal to j)

►6) The error term, E, is normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

SEE

(Standard Error of Estimate)

A

This is the standard deviation of the distribution of the Error Term

The smaller the SEE the more acurate the regression

B0 & b1 are estimated so we lose 2 degrees of freedom hence having n -2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Error Term

A

The portion of the dependent variable that is not explained by the independent variable(s) in the regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

R2

Coefficient of Determination

(Single Independent Variable)

A

Measures the fraction of the total variation in the DV that is explained by the IV

if Only 1 IV square the correlation between DV & IV

r = PXY =

Thus if r = .9203 then r2 = 0.8470

Thus the IV explains 84.7% of the variation in Y

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

R2

Coefficient of Determination

(Multiple Independent Variable)

A

Measures the fraction of the total variation in the DV that is explained by the IV

R2 = 1 -

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Confidence Invterval for a regression coefficient

A

An interval of values that is believed to include the true parameter value of b1 with a given degree of confidence.

Smaller Standard Deviation = tighter confidence Interval

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Hypothesis Testing

A

Level 2 only does T statistics

=

T = Test statisitic (Typically tested at µ = 5%

At this level the critical value is 1.96

Therefore anything with a T- Statistic of less than 1.96 can be rejected.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Dependent Variable

A

The variable whose variation around the mean is to be explained by the regression.

The left hand side variable in a regression equation.

Usually called Y

y=f(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Independent Variable

A

A variable used to explain the dependent variable in a regression;

a right-hand-side variable in a regression equation.

Usually called X

y=f(x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

P Value

A

The smallest level of significance at which the null value can be rejected

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

SSE

A

Sum of Squared Errors or residuals

Also known as residual sum of Squares

(The Unexplained)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

RSS

A

Regression Sum of Squares

This value is the amount of total variation in Y that is explained in the regression equation.

(The Explained)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

TSS

A

Total Sum of the Squares

TSS = SSE + RSS

TSS = The Unxplained + The Explained
(SSE) (RSS)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

ANOVA

A

ANalysis Of VAriance

Determine the usefulness of the IVs in explaining the variance in the DV

ANOVA provides the inputs for an F-Test of the sig

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Estimated parameters

A

With reference to a regression analysis, the estimated values of the population intercept and population slope coefficient(s) in a regression.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Fitted parameters

A

With reference to a regression analysis, the estimated values of the population intercept and population slope coefficient(s) in a regression.

17
Q

Linear regression

A

Regression that models the straight-line relationship between the dependent and independent variable(s).

18
Q

Regression coefficients

A

The intercept and slope coefficient(s) of a regression.

19
Q

ANOVA F Statistic

A
20
Q

(3) Limitations of Linear Regression

A
  1. Regression relations can change over time
    e.g. Parameter instability
    Sensitive to:
    - Time Period Selected
    - Sampling from more than 1 population
  2. What works only works if it is kept secret
    - Public knowledge may negate usefulness
  3. Output dependent on regression assumptions
    - tests can be performed on the €
    - typically not unequivocal
21
Q

Parameter instability

A

The problem or issue of population regression parameters that have changed over time.

22
Q

Sample Variance
of the Dependent Variable

A

or