Intro to Biostatistics Flashcards

(73 cards)

1
Q

Dependent Variable

A

Outcome you are measuring or looking for

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Independent Variable

A

What is manipulated/changed during an experiment or study

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Null Hypothesis (H0)

A

States there will be no true difference between the groups being compared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Alternative Hypothesis (H1)

A

States there will be a true difference between the groups being compared

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Nominal Grouping

A

Dichotomous/binary; non-ordered, named categories; no order or magnitude, no consistency of scale or equal distances; simply labeled variables without quantitative characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ordinal Grouping

A

Ordered, rank-able categories; non-equal distance; they have order/magnitude but have NO consistency of scale or equal distances

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interval/Ratio Grouping

A

Ordered, magnitude, and equal distances/units; have order/magnitude AND consistency of scale/equal distances

Ex: Living siblings (number) and personal age (in years)

Interval: Arbitrary zero value (but 0 doesn’t mean absence)

Ratio: Absolute rational zero value (0 DOES mean absence of measurement value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which groups are considered “discrete” data?

A

Nominal and Ordinal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Which groups are considered “continuous” data?

A

Interval/Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Mode

A

Most common number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Median

A

Middle number after numbers are placed in order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mean

A

Average of all numbers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Minimum, Maximum, Range

A
Minimum = lowest
Maximum = highest
Range = difference between min and max
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Variance

A

Average of the squared differences in each individual measurement value (x) and the groups’ mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Standard Deviation

A

Square root of variance value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When a dataset is normally distributed, which values are equal or near equal?

A

Mean and Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

1 standard deviation is what percentage under the curve?

A

68%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

2 standard deviations is what percentage under the curve?

A

95%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

3 standard deviations is what percentage under the curve?

A

99.7%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Positively Skewed

A

When mean is higher than median; tail is pointing to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Negatively Skewed

A

When mean is lower than the median; tail is pointing to the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Kurtosis

A

Measure of the extent to which observations cluster around the mean; for a normal distribution, the value of the kurtosis statistic is 0

Positive Kurtosis = more cluster
Negative Kurtosis = less cluster

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Skewness

A

Measure of the asymmetry of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

Required assumptions of Interval/Ratio data for the proper selection of a parametric test

A
  1. Normally distributed
  2. Equal variances (multiple tests available for equal variances between groups)
  3. Randomly-derived and independent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Levene's Test
Test that tells us if data is normally distributed or not and if it has equal variances
26
How do you handle data that is NOT normally-distributed?
Use a statistical test that does not require the data to be normally distributed, such as ordinal or nominal tests, or transform data to a standardized value with the hope that this transformation allows data to be normally-distributed (may not work)
27
Type 1 Error
NOT accepting the null hypothesis when it is actually true and should have been accepted; there really is no true differences between the groups; also called "alpha"
28
Type 2 Error
Accepting the null hypothesis when it is actually false, and you should NOT have accepted it; there really IS a true difference between the groups being compared; also called "beta"
29
Power
1-beta; statistical ability of a study to detect a true difference, IF one truly exists between group-comparisons, and therefore the level of accuracy in correctly accepting/not accepting the null hypothesis
30
Sample Size
The larger the sample size, the greater the likelihood (ability) of detecting a difference if one truly exists; also increases power
31
P Value
Statistical tests determine possible error-rate or likelihood of chance in comparing difference or relationship between variables
32
Confidence Interval (CI)
Most common selections are 90, 95, or 99%; calculated at an a priori percentage of confidence that statistically includes the real (yet unknown) difference or relationship being compared; based on variation in sample and sample size
33
Interpretation of a 95% CI
We are 95% confident that the "true" difference or relationship between the groups is contained within the confidence interval range
34
What does it mean when a CI crosses 1.0 for ratios or 0.0 for absolute differences?
Means that CI is NOT significant
35
Does "statistical" significance always confer meaningful, "clinical" significance?
No
36
Correlation (r)
Provides a quantitative measure of the strength and direction of a relationship between variables; values range from -1.0 - 1.0
37
Partial Correlation
A correlation that controls for confounding variables
38
What is the name of the nominal correlation test?
Contingency coefficient
39
What is the name of the ordinal correlation test?
Spearman correlation
40
What is the name of the interval correlation test?
Pearson correlation; for a pearson correlation, a p value of >0.05 means there is no linear correlation, but there MAY still be a non-linear correlation present
41
Survival Tests
Compares the proportion of events over time, or time-to events, between groups; commonly represented by a Kaplan-Meier Curve
42
What is the name of the nominal survival test?
Log-Rank Test
43
What is the name of the ordinal survival test?
Cox-Proportional Hazards Test
44
What is the name of the interval survival test?
Kaplan-Meier Test
45
Regressions
Provide a measure of the relationship between variables by allowing the prediction about the dependent, or outcome, variable (DV) knowing the value/category of independent variables (IVs); can also calculate OR for a measure of association
46
What is the name of the nominal regression test?
Logistic Regression
47
What is the name of the ordinal regression test?
Multinomial Logistic Regression
48
What is the name of the interval regression test?
Linear Regression
49
What are the 4 questions you should ask when selecting the correct statistical test?
1. What data level is being recorded? 2. What type of comparison/ assessment is desired? 3. How many groups are being compared? 4. Is the data independent or related/paired?
50
What is the name of the nominal test when comparing 2 groups of independent data?
Pearson's Chi-Square Test
51
What is the name of the nominal test when comparing 3 or more groups of independent data?
Chi-Square Test of Independence
52
What is the name of the nominal test when comparing 2 or more groups of independent data that have an expected cell count of less than 5?
Fisher's Exact Test
53
What is the name of the nominal post-hoc test?
Bonferroni Test of Inequality (Bonferroni Correction); adjusts p value for # of comparisons being made
54
What is the name of the nominal test when comparing 2 groups of related data?
McNemar Test
55
What is the name of the nominal test when comparing 3 or more groups of related data?
Cochran
56
What is the name of the ordinal test when comparing 2 groups of independent data?
Mann-Whitney Test
57
What is the name of the ordinal test when comparing 3 or more groups of independent data?
Kruskal-Wallis Test
58
What is the name of the ordinal test when comparing 2 groups of related data?
Wilcoxon Signed Rank Test
59
What is the name of the ordinal test when comparing 3 or more groups of related data?
Friedman Test
60
What are the names of the ordinal post-hoc tests?
Student-Newman-Keul, Dunnett, Dunn
61
Student-Newman-Keul Test
Compares all pairwise comparisons possible and all groups must be equal in size
62
Dunnett Test
Compares all pairwise comparisons against a single control and all groups must be equal in size
63
Dunn Test
Compares all pairwise comparisons possible and it is useful when all groups are not of equal size
64
What is the name of the interval test when comparing 2 groups of independent data?
Student T-Test
65
What is the name of the interval test when comparing 3 or more groups of independent data?
Analysis of Variance (ANOVA)
66
What is the name of the interval test when comparing 3 or more groups of independent data with confounders?
Analysis of Co-Variance (ANCOVA); compares the means of all groups against a dependent variable while also controlling for the co-variance of confounders
67
What is the name of the interval test when comparing 2 groups of related data?
Paired T-Test
68
What is the name of the interval test when comparing 3 or more groups of related data?
Repeated Measures ANOVA with 1 dependent variable
69
What is the name of the interval test when comparing 3 or more groups of related data with confounders?
Repeated Measures ANCOVA; compares the means of all groups against a dependent variable while also controlling for the co-variance of confounders
70
What are the names of the interval post-hoc tests?
Student-Newman-Keul, Dunnett, Dunn, Tukey or Scheffe, and Bonferroni Correction
71
Tukey/Scheffe Tests
Compares all pairwise comparisons possible and all groups must be equal in size; tukey test is slightly more conservative than the SNK; scheffe test is less affected by violations in normality and homogeneity of variances - most conservative
72
Kappa Statistic
Correlation test showing relationship or agreement between evaluators (consistency of "decisions" or "determinations")
73
Interpreting a Kappa Statistic
+1 - observers perfectly "classify" everyone exactly the same way 0 - there is no relationship at all between the observers "classifications", above the agreement that would be expected by chance -1 - observers "classify" everyone exactly the opposite of each other