Section C.2: Regression analysis Flashcards

1
Q

What is regression analysis?

A

Regression analysis relates to the creation of a “predictive model”, a formula which will allow you to predict the values of a dependent/response variable based on the independent/predictor variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe the AIM of linear regression analysis

A

The aim of linear regression analysis is to establish a linear relationship (given as a mathematical formula as in, y=mx+c) between the predictor variables (IV) and the response variable (DV), so that we can use this formula to estimate the value of the response variable, when only the predictors’ values are known.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

The predictor variable is

Independent or dependent?
y-axis or x-axis?

A

The predictor variable is the independent variable plotted on the x-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

The response variable is

Independent or dependent?
y-axis or x-axis?

A

The response variable is the dependent variable, and is plotted on the y-axis

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the dependent variable?

A

The dependent variable is the variable that may be affected by the independent variable, and is plotted on the y-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the independent variable?

A

The independent variable is the variable that may affect the dependent variable/s, and is plotted on the x-axis.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is a residual in regression analysis?

A

A residual is the difference between the actual value of the response variable and the predicted value generated by the model for any given point.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the R-Squared value in a regression model?

A

The R-Squared value (or coefficient of determination) is a statistical measure in a regression model that determines the proportion of variance in the dependent variable that can be explained by the independent variable, or how well the data fits the regression model (goodness of fit).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What characteristics must data have to be fit for linear regression analysis?

A

Numerical, normal, outlier-free, and correlated.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is a correlation coefficient in regression analysis?

A

A correlation coefficient is a statistical measure that suggests the level of linear dependence between the two variables

High correlation is any correlation coefficient -0.5 < x < 0.5
Low correlation is any correlation coefficient -0.2 < x < 0.2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

R-Squared value shows how much variation within the data is explained by the model, what R-Squared value is considered good for prediction?

A

More than 70% or (0.7) R-Squared is considered good for prediction.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the line of best fit in a regression model?

A

The line of best fit, or regression line, is the slope of the linear model. It represents the expected value of the response variable as predicted by the model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What assumptions are made in multiple linear regression? (4)

A

Assumptions of multiple linear regression include:
1. Normality
2. No outliers
3. Constant variance
4. Independence

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is multiple linear regression?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are the two types of linear regression? (2)

A

Simple linear regression ( 1 IV, 1 DV)
Multiple linear regression ( 2+ IV, DV)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly