Week 8 Flashcards

1
Q

What does correlation tell us?

A

About the degree of association between two variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the regression equation used to do?

A

Express the relationship between 2 or more variables, therefore allowing us to estimate one variable on the basis of another

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is testing for statistical significance of the regression slope and how is it done?

A

Tests if regression coefficient is significantly different from 0.

1) find r and b
2) set hypothesese (H0: β=0)
Rest of normal steps, use t distribution

t = (b-0)/s

Where s is standard error of slope estimate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

3 methods to tell us coefficient is significantly different from 0?

A

Use confidence intervals for coefficient
Statistical test using t-dist.
Use p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the p-value?

A

The exact level of α at which the null hypothesis will be rejected

Same as:

Probability of being wrong if we reject H0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the decision rule in hypothesis testing when only given p-value and α?

A

Reject null hypothesis if p-value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does the coefficient of determination measure?

A

It measures the proportion of total variation in the dependent variable (y) that is explained by the variation in the independent variable (x)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to calculate R^2 if there is only one independent variable?

A

R^2 = r^2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How to calculate R^2 if more than one IV?

A

R^2 = SSR/SStotal = 1 - SSE/SStotal

SStotal = Σ(Y-Ybar)^2
SSR = Σ (Yhat-Ybar)^2
SSE = Σ (Y-Yhat)^2
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What does the regression output give?

A

An estimate of the joint significance of all variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How to test goodness of fit? (Which test?)

A

F test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

3 ways to analyse real data regression output?

A

Look at (+/-) sign and significance of the coefficients
R^2
F statistic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Different parts of multiple regression equation? (5)

A
x = independent variable
b = regression coefficient
a = y-intercept
k = number of independent variables
y = dependent variable
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Equation used to test for significance of coefficients and DofF?

A

t = (b-0)/s

DofF = n - (k+1)

n = number sampled
k = number IVs
+1 is to account for constant ‘a’

T-dist. test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What are control variables and why?

A

Variables put in for the purpose of excluding possible alternative explanations for significant relationships between y and the variable(s) of interest.

By including more CVs in the regression equation, this allows us to analyse the relationship between one of the variables and y.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

How to calculate size of sample from ANOVA table?

A

Use n - (k+1) = Residual error DofF and rearrange to find n

17
Q

What are the hypotheses for coefficient determination and what do they mean?

A

H0: β=0
H1: β not equal to 0

If accept H0 the IV relevant is deemed ‘useless’ and so the variable should be excluded from the equation