Chapters Beyond 9 Flashcards
(46 cards)
What does the mean squares represent?
The between group variability and the within group variability.
Between is on top
Within is on bottom
When do we do in pairwise comparisons?
When we reject H_o in an ANOVA and want to see which means are different between groups.
Why do we do pairwise comparisons?
To be able to see which groups have different means
What does it mean if two variables measured on the same subject are associated?
Knowing one value of the variable tells us something about the value of the other variable.
What do we use to show the correlation coefficient, and what are its units?
r.
r has no units
What is the parameter for the correlation coefficient?
Rho (p)
r = 1 corresponds to what correlation?
A perfect positive correlation.
r = -1 corresponds to what correlation?
A perfect negative correlation.
r = 0 signifies what?
No correlation. No linear association.
What happens to the scatter plot when there is a strong linear association?
The points are tightly clustered around a line.
What is the explanatory variable?
X
Which is the response variable?
Y
Does r change if we interchange the explanatory and response variables?
No
When there is a strong linear association, what do we know?
That information about one variable helps in predicting the other.
What happens in a weak association?
The points are scattered broadly
What does a low r mean?
It means there is no linear association - however not necessarily that there is no association
Correlation does not imply _______
Causation
What is linear regression used for?
To find a line that summarizes the linear relationship between two variables.
With it we can make predictions about y, the response variable
How do we notate the regression line?
y_i = beta_o + beta_1x_i + epsilon_i
beta_o = intercept beta_1 = slope epsilon_i = error term. Indicates how far y_i is from the line.
What is beta_o?
Intercept
Represents the average value of y when x is zero.
What is beta_1?
Slope
Represents the change in the average for y for every one unit increase in x
What is epsilon_i?
error term. Indicates how far y_i is from the line.
We estimate the slope and intercept of the regression line from the data to get:
y_hat i = beta_hat o + beta_hat i * x
You don’t want to use a regression line to estimate values that are…
Outside of the range of data we got in our sample. (This is known as extrapolation)