Tests of Relationships- Correlation; Regression Flashcards
(36 cards)
The questions below test _________ or __________.
- Is group A different from group B?
- Does this treatment cause this outcome?
Difference or proportions depending on the outcome variable type.
- Test difference for _________ outcomes.
- Test proportions for ________ outcomes.
- continuous
- categorical
The questions below test _____________.
- What is the relationship between A and B?
- Does variable A increase with variable B?
relationship
Examples:
Which tests would be appropriate for the following questions?
- ) Is the male group different from the female group by their BMI level?
- ) Is the gender linked to a certain age group?
- ) What is the relationship between BMI and DBP at baseline?
- ) Test the difference of the mean BMI between groups of male and female.
- ) Compare the proportions of being older than 50 yrs between the groups of male and female.
- ) Test relationship between DBP and BMI.
Tests of Differences (parametric vs non-parametric):
-What 2 tests are used for two group independent comparison? Which is used for parametric data? Which is used for non-parametric data?
- What 2 tests are used multi group independent comparison? Which is used for parametric data? Which is used for non-parametric data?
- What 2 tests are used two group paired comparison? Which is used for parametric data? Which is used for non-parametric data?
- What 2 tests are used for multi group paired comparison? Which is used for parametric data? Which is used for non-parametric data?
- Independent t-test, Mann-Whitney U Test
- Independent t-test = parametric
- Mann-Whitney U Test = non-parametric
- ANOVA, Kruskal-Wallis H Test
- ANOVA = parametric
- Kruskal-Wallis H Test = non-parametric
- Paired t-test, Wilcoxon Signed Rank Test
- Paired t-test = parametric
- Wilcoxon SIgned Rank Test = non-parametric
- Repeated Measures ANOVA, Friedman Test
- Repeated Measures ANOVA = parametric
- Friedman Test = non-parametric
Tests of Proportions (parametric vs non-parametric):
-What 2 tests are used for two group independent comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
- What 2 tests are used for multi group independent comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
- What 2 tests are used for two group paired comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
- What 2 tests are used for multi group paired comparison? Which is used for data that isn’t sparse? Which is used for sparse data?
- Chi-Square, Fisher’s Exact
- Chi-Square = data not sparse
- Fisher’s Exact = data sparse
- Chi-Square, Fisher’s Exact
- Chi-Square = data not sparse
- Fisher’s Exact = data sparse
- McNemar Test, McNemar Exact Test
- McNemar Test = data not sparse
- McNemar Exact Test = data sparse
- Stuart-Maxwell Test, Generalized Stuart-Maxwell
- Stuart-Maxwell Test = data not sparse
- Generalized Stuart Maxwell = data sparse
Tests of Cerrelation; Regression:
1
- Correlation is when you look at the ____________ between two variables.
- Draw a ______________ to visualize
- Compute a ________________ to quantify
- relationship
- scatter plot
- correlation coefficient (r)
_____________ is a way of visualizing the relationship between two variables. Each point represents the intersection of a pair of related observations. They can visually clarify the _______ and shape of a relationship.
- Scatter plot
- strength
________ _________ is a decimal number in the range of -1 to +1 and is a measure of linear relationships between two variables.
Correlation coefficient (r)
- What is a perfect positive correlation value?
- What is a perfect negative correlation value?
+1
-1
With correlation coefficient:
- The sign of r indicates __________.
- The absolute value of r indicates __________.
- direction
- strength
- How do we interpret correlation coefficient (r) values?
- Should these values be used as strict cutoff points? Why or why not?
- 0-0.25 = little or no relationship
- 0.25-0.50 = fair
- 0.50-0.75 = moderate to good
- 0.75-1 = good to excellent
-No, because they are affected by sample size, measurement error, and the types of variables beinig studied.
Correlation coefficient is a measure of ________ relationship only. It cannon be used for _____________ relationship.
- linear
- curvilinear
What are 4 types of correlation coefficients?
- Pearson (Product-Moment)
- Spearman Rank
- Phi
- Point Biserial
- _________ Correlation Coefficient is the most commonly reported measure of correlation.
- It is appropriate when X and Y are __________ variables with underlying ________ distributions.
- Pearson
- continuous, normal
- __________ Correlation Coefficient is a __________ analog of the Pearson (r).
- It is appropriate for use when X and Y are ________ variables.
- Spearman Rank, non-parametric
- ordinal
- _________ Correlation Coefficient is a special case of the Pearson (r), given only ____ values of X and Y.
- It is appropriate for use when both X and Y are ___________ variables.
- Phi, two
- dichotomous
- _____________ Correlation Coefficient is a special case of the Pearson (r).
- It is appropriate when a ___________ X is correlated with a ___________ variable Y.
- Point Biserial
- dichotomous X, continuous Y
What are some precautions in using Correlation Coefficient?
Nonlinearity
-A strong curvilinear relationship may be identified as no correlation.
Outlier
-A single point can have a large influence on the correlation.
Interpretation is subjective
-No hard and fast rules that determine an (r) value is strong, moderate, or weak
-Interpretation should be based on the nature of the data, purpose of the research, and the researcher’s knowledge of the subject matter
Causation
-No causal relationship can be determined based on the (r) value
-Correlation of A and B is the same as correlation of B and A
-Correlation cannot be used to establish a cause-and-effect situation.
- Regression is when you look at the relationship between two variables in a ______________ situation.
- Draw a _________ with a _________ line to visualize.
- Compute a ________________ to quantify.
- cause-and-effect
- scatter plot, regression line
- coefficient of determination (R²)
___________ is used to predict values of one variable from another:
-(X -> Y) where X is the ___________ (predictor/explanatory) variable, and Y is the ___________ (outcome/response) variable.
- Regression
- independent,dependent
What are the 2 types of regression?
- Linear
- Logistic
- __________ Regression is used to examine the causal relationship of the two variables, X and Y, that are linearly related.
- Fits a regression line Y=a+bX through the points and estimate a and b
Linear