Module 10.1: Linear Regression Flashcards

(61 cards)

1
Q

What is the purpose of simple linear regression?

A

To explain the variation in a dependent variable in terms of the variation in a single independent variable.

Variation is interpreted as the degree to which a variable differs from its mean value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

How is variation defined in the context of regression?

A

The degree to which a variable differs from its mean value.

Variation should not be confused with variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the dependent variable in linear regression?

A

The variable whose variation is explained by the independent variable.

It is also referred to as the explained variable, endogenous variable, or predicted variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the independent variable in linear regression?

A

The variable used to explain the variation of the dependent variable.

It is also referred to as the explanatory variable, exogenous variable, or predicting variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Fill in the blank: The dependent variable is also referred to as the _______.

A

explained variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Fill in the blank: The independent variable is also referred to as the _______.

A

explanatory variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

True or False: Variation and variance are the same concepts.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What question does simple linear regression aim to answer?

A

“What explains fluctuations in the dependent variable?”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the role of GDP in predicting stock returns?

A

GDP is the independent variable, while stock returns are the dependent variable.

Stock returns are explained by GDP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Fill in the blank: The dependent variable is explained by the _______.

A

independent variable

In this context, stock returns are explained by GDP.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the relationship between independent and dependent variables?

A

The independent variable explains the variation in the dependent variable.

In this case, GDP explains stock returns.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does Yi represent in the linear regression model?

A

ith observation of the dependent variable, y

Yi is the value we are trying to predict or explain.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What does Xi represent in the linear regression model?

A

ith observation of the independent variable, X

Xi is the predictor variable used to explain variations in Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is b0 in the context of linear regression?

A

regression intercept term

bo is the value of Y when X is zero.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does bi denote in the linear regression model?

A

regression slope coefficient

bi indicates the change in Y for a one-unit change in X.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the significance of the term ei in the linear regression model?

A

residual for the ith observation (also referred to as the disturbance term or error term)

&i accounts for the difference between the observed and predicted values of Y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the main purpose of the regression process in this model?

A

estimates an equation for a line through a scatter plot of the data that ‘best’ explains the observed values for Y in terms of the observed values for X

This involves minimizing the sum of the squared residuals.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Fill in the blank: In the linear regression model, Yi = b0 + bi Xi + _____

A

ei

This term represents the residual for each observation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the form of the linear equation often called the line of best fit?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What does the hat ‘^’ above a variable indicate?

A

Predicted value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the regression line?

A

The line that minimizes the sum of the squared differences between predicted and actual Y-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

What is the sum of squared errors (SSE)?

A

The sum of the squared vertical distances between estimated and actual Y-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

True or False: The regression line is the only line that can be drawn through a scatter plot of X and Y.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Fill in the blank: The regression line minimizes the sum of the squared differences (vertical distances) between the _______ predicted by the regression equation and the actual Y-values.

A

Y-values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
What does the regression line minimize?
The SSE ## Footnote SSE stands for Sum of Squared Errors.
26
What is another name for simple linear regression?
Ordinary least squares (OLS) regression
27
What does the estimated slope coefficient describe?
The change in Y for a one-unit change in X
28
How can the slope term be positive, negative, or zero?
It depends on the relationship between the regression variables
29
How is the slope term calculated?
30
What does the intercept term represent?
The line's intersection with the Y-axis at X = 0
31
Can the intercept term be positive, negative, or zero?
Yes
32
How can the intercept term be expressed using the means of Y and X?
33
What coordinates does the regression line pass through?
The mean of the independent and dependent variables (i.e., the point Xbar , Ybar)
34
What does the estimated intercept represent in regression analysis?
The value of the dependent variable at the point of intersection of the regression line and the vertical axis.
35
When is the intercept an estimate of the dependent variable?
When the independent variable is zero.
36
What does the estimated slope coefficient indicate?
The expected change in the dependent variable for a one-unit change in the independent variable.
37
If the estimated slope coefficient is 2, how much is the dependent variable expected to change for a one-unit change in the independent variable?
By two units.
38
What is the slope coefficient in a regression of excess returns on market returns called?
The stock's beta ## Footnote Beta is an estimate of the systematic risk of a stock.
39
What does a beta less than 1 indicate about a stock's risk?
Less risky than the average stock ## Footnote This means the stock's returns tend to change less than the overall market.
40
What does a beta of 1 signify?
Average level of systematic risk ## Footnote A stock with this beta moves in line with the market.
41
What does a beta greater than 1 indicate?
More-than-average systematic risk ## Footnote This means the stock's returns are more volatile than the market.
42
What must be assessed to determine the importance of an independent variable in explaining a dependent variable?
Statistical significance of the slope coefficient ## Footnote This involves hypothesis testing or forming a confidence interval.
43
What is the first assumption of linear regression?
A linear relationship exists between the dependent and the independent variables.
44
What does homoskedasticity refer to in linear regression?
The variance of the residual term is constant for all observations.
45
What does it mean for the residual term to be independently distributed?
The residual for one observation is not correlated with that of another observation.
46
How is the normality of the residual term defined in linear regression?
The residual term is normally distributed.
47
When is a linear regression model not appropriate?
When the underlying relationship between X and Y is nonlinear.
48
What is a method to check for linearity in a regression model?
Examine the model residuals in relation to the independent regression variable.
49
Fill in the blank: The residual term in linear regression must be _______.
normally distributed.
50
True or False: In linear regression, the paired x and y observations are dependent on each other.
False.
51
What does homoskedasticity refer to?
The case where prediction errors all have the same variance. ## Footnote Homoskedasticity is an important assumption in regression analysis.
52
What is heteroskedasticity?
The situation when the assumption of homoskedasticity is violated. ## Footnote Heteroskedasticity can lead to inefficient estimates in regression models.
53
What is a type of heteroskedasticity related to the error term?
The variance of the error term changes over time.
54
What does it indicate if the magnitude of errors exhibits a pattern of changing over time?
It indicates heteroskedasticity.
55
What is the implication of large prediction errors observed in monthly sales data?
It suggests a lack of independence in the relationship between the variables.
56
What does it mean if the residuals are not independent?
It means our estimates of the model parameters' variances will not be correct.
57
True or False: Independence of the relationship between X and Y ensures that residuals are independent.
True
58
What does it indicate when the residuals are normally distributed? We can conduct hypothesis testing for evaluating the goodness of fit of the model when the residual are ____________.
Normally distributed
59
What is the central limit theorem's role regarding parameter estimates, when the residuals are not normally distributed?
With a large sample size, parameter estimates may be valid even when the residuals are not normally distributed. ## Footnote This theorem supports the robustness of statistical inference under certain conditions.
60
What are outliers in the context of regression analysis?
Observations that are far from our regression line, characterized by large prediction errors or X values that are far from others. ## Footnote Outliers can skew the results and affect the accuracy of the model.
61
What are the values determined by the estimated regression equation called?
Least squares estimates