Correlation and Partial Correlation Flashcards
Bivariate linear correlation
- examines the relationship between two variables
what can bivariate relationships vary in:
- form (linear, curvilinear)
- direction (positive / negative)
- magnitude/strength
magnitude/strength in bivariate relationships
- r = -1 : perfect negative relationship
- r = +1 : perfect positive relationship
- r = 0 : no relationship
positive or negative correlation
correlation does not mean causation
strength of correlation : strong negative/positive
+/- 0.9, 0.8, 0.7
strength of correlation: moderate negative/positive
+/- 0.6, 0.5, 0.4
strength of correlation: weak negative/positive
+/- 0.3, 0.2, 0.1
correlation hypothesis testing
- linear correlation involves measuring the relationship between two variables measured in a sample
- we are more interested in if there is a relationship in equivalent population variables
- use sample statistics to estimate population parameters
- H0: there is no relationship between the population variables
p-value in correlation
what is the chance of measuring a relationship of that magnitude when the null hypothesis is true?
- reject null if p < .05
parametric assumptions
- both variables should be CONTINUOUS (if not use non-parametric alternative)
- related PAIRS: each participant should have a pair of values
- absence of outliers
- linearity: point sin scatterplot should be best explained with a STRAIGHT line
- sensitive to range restrictions: floor and ceiling effects
non-parametric alternative
- if assumptions violated
- Spearman’s rho (or Kendall’s Tau if fewer than 20 cases)
(or fewer than 7 points on a likert scale you use one of these)
floor effect
cluster of scores at bottom of scale
- form of range restriction
ceiling effect
clustering of scores at top of scale
- form of range restriction
PPMCC
pearson product-moment correlation coefficient
what does Pearson’s correlation coefficient investigate
the relationship between two quantitative, continuous variables
what does Pearson’s produce
a correlation coefficient ‘r’ which is a measure of the strength of association between the two variables
Covariance
- for each data point, calculate the difference from the mean of x, and the difference from the mean of y
- multiply the differences
- sum the multiplied differences
- divide by N -1
what does covariance do
- provides a measure of the variance shared between x and y variables
correlation coefficient and covariance
- ‘r’ is a ratio of covariance (shared variance) to separate variances
- we can obtain a measure of separate variances by multiplying the standard deviation for x and y
- DON’T NEED TO DO THIS BY HAND
correlation coefficient strength
- ‘r’ is a ratio
- if covariance is large relative to separate variances, r will be further from 0
- if covariance is small relative to separate variances, r will be closer to 0
what can r represent
it can tell us how well a straight line fits the data point i.e. the strength of correlation
- if data points cluster around the line, r is further from 0
- if data points scattered around line, r is closer to 0
SPSS output for correlation
- r tells you strength
- p tells you if correlation is significant
- N helps you calculate d.f.
degrees of freedom for r
N - 2
- report when reporting r
e.g. r(23) = .522, p = .007
sampling error
- r value obtained from another sample from the same population would likely be different
- reflects sampling error