Unit 3 Flashcards

(128 cards)

1
Q

What are the 3 attributes of study variables?

A

Order / Magnitude

Consistency of scale / equal distances

Rational absolute zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 3 levels of data measurement?

A

Nominal
Ordinal
Interval/ Ratio

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Nominal

A

Dichotomous/ Binary

Non-ranked (non-ordered)

Named categories

 - categorical data
 - can be more than 2 categories 
 - ex: 2 genders, 2 age groups 

No order/ magnitude

No consistency of scale or equal distances

Nominal variables are simply labeled- variables without quantitative characteristics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Examples of Nominal variables

A

What is your gender?
- Male or Female

What is your hair color?
- brown vs black vs blonde vs grey vs other

Education level (if made binary)

Smoking vs non-smoking

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Ordinal

A

Ordered and order-able

Rank-able categories

Non-equal distance between ranges
- technically can be equal and unequal

Unitless

Yes order/ magnitude

No consistency of scale or equal distances

 - No units or scales 
 - No even spacing between them 

Data is collected in categories and can be ordered

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Examples of Ordinal variables

A

Pain Scales
- patient decides what each value means

Strongly agree > somewhat agree > Neither > somewhat disagree > strongly disagree

SES
- unitless, broken into categories

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Interval/ Ratio

A

Order/ magnitude

Equal Distances

  • unitless
  • equal spaces between scales

Interval

 - Arbitrary 0 value 
 - 0 doesn't mean absence 
 - Can be 0 or negative values

Ratio

- Absolute 0 value 
- 0 means absence of measurement value 
- No negative values 
- Ex: physiological parameters 
      - blood pressure
      - blood sugar
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Examples of Interval/ Ratio variables

A

Living siblings and personal age

Height in cm

Speed in m/s

LDL in mg/ dL

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Mean

A

Average value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Median

A

Middle value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Mode

A

Most common value

This is the most useful measurement for descriptive statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is descriptive statistics?

A

Tells us about our population

Describes our population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Range

A

Maximum - minimum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Interquartile Range

A

Top 25% = Q3
Bottom 25% = Q1
Middle 50% = Q3 - Q1
- represented the 25% above and below the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variance

A

The average of the squared differences in each individual measurement value and the groups mean

Describes the spread of data

Variance from the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Standard Deviation

A

Square root of variance

Restores units of mean

Describes spread of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Normal Distribution

A

Symmetrical

Mean and median are (almost) equal

Equal dispersion of curve (tails) to both sides of mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Statistical tests useful for normal- distributed data are known as _____

A

Parametric Tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Required assumptions of interval/ ratio data for proper selection of parametric tests

A
  1. Normal distribution
  2. Equal variances
    • use Levene’s Test
  3. Randomly derived and independent
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Levene’s Test

A

Test used to calculate if data is normally distributed and has equal variance

Used to assess if the variances are different between groups

Null Hypothesis: groups are equal

Tries to show that there is a difference between groups

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How to handle interval data that is not normally distributed

A
  1. Use a statistical test that does not require the data to be normally distributed
    • non-parametric tests
    • step down and run ordinal test
  2. Transform data to a standardized value
    • hope that the transformation allows data to be normally distributed
    • z score or log transformation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Positively Skewed

A

Asymmetric distribution with one tail longer than the other

Mean > median
- mean is higher than median

Tail points to the right

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Negatively Skewed

A

Asymmetric distribution with one tail longer than the other

Mean < median
- mean is lower than medium

Tail points to the left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What effect do outliers have on skewness?

A

Outliers pull the tails out farther

Contributes to skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Skewness
A measure of the asymmetry of a distribution Perfectly normal distribution is symmetric - skewness = 0 Negative skewness = negatively skewed data Positive skewness = positively skewed data
26
Kurtosis
A measure of the extent to which observations cluster around the mean - how peaked a value is Kurtosis of normal distribution curve = 0 Positive kurtosis = more cluster (around a number) - closer to a positive value Negative kurtosis = less cluster (around a number) - closer to a negative value
27
68%
1 standard deviation away from the mean
28
95%
2 standard deviations away from the mean
29
99.7%
3 standard deviations away from the mean
30
Null Hypothesis
Research perspective which states there will be no true difference between the groups being compared Most conservative and most commonly utilized At end of study, either need to accept or reject the null Can take on the superiority, noninferiority, and equivalency perspectives
31
Alternative Hypothesis
Research perspective which states there will be a true difference between the groups being compared
32
Type 1 error
Alpha Not accepting the null hypothesis when it is actually true and you should have accepted it Rejecting the null hypothesis when you shouldn't have Ex: Telling a man he is pregnant
33
Type 2 error
Beta Accepting the null hypothesis when it is actually false and you should not have accepted it Not rejecting the null hypothesis when you should have Ex: Telling a pregnant woman she is not pregnant
34
P value
Probability value (alpha) Based on the probability, due to chance alone, a test statistic value as extreme or more extreme than actually observed if groups were similar (not different) Represents your chances of being wrong If p < 0.05, risk of experiencing a type 1 error is acceptably low
35
T/F: If p value is lower than the pre-selected alpha (5% or 0.05), it is statistically significant
True
36
Do you accept or reject the null hypothesis if p value < alpha?
REJECT
37
What is the interpretation of a p value?
The probability of making a type 1 error if the null hypothesis is rejected
38
If the data is statistically significant and there are 3+ groups, the p value tells you what?
Tells you that there is at least 1 difference present Guaranteed difference between control and most extreme value Lowest and highest value represent the difference
39
At baseline, do we want groups to be equal?
Yes At start of study/ baseline, p value should be 1.0 Want p values to start above 0.05
40
Power
The statistical ability of a study to detect a true difference if only one truly exists between group comparisons and therefore the level of accuracy in correctly accepting/ not accepting the null hypothesis If there is truly a difference between groups, study has high power Studies are set up to have 80% power When lose people, lose power
41
What is the mathematical representation of power?
1 - beta = 1 - type 2 error rate We allow a type 2 error rate of 20% - accept 20% of risk of finding an error 1 represents sample size
42
The _____ people the study has, the _____ the power
More | Higher
43
What are the common elements utilized in determining sample size of a study?
Minimum difference between groups deemed significant - the smaller the difference between groups necessary to be considered significant, the greater the sample size needed Expected variation of measurement Alpha and beta error rates and confidence interval
44
How does sample size affect power?
The larger the sample size, the greater the likelihood of detecting a difference if one truly exists Increases power
45
What is the number one way you can ensure your study has power?
To show the difference between groups if a difference is really present
46
When is it okay for p values to be non-statistically significant?
Start of study Levene's Test
47
What is a caveat of p values?
They do not tell us about spread/ dispersion
48
Confidence Intervals
Precision measurement CI around a group's differences help reader understand where true value may lie Calculated at an a priori percentage of confidence that statistically includes the real (yet unknown) difference or relationship being compared
49
What advantage do confidence intervals have over p values?
Tells us about statistical significance and spread
50
What are confidence intervals based on?
Variation in sample Sample size
51
If you don't use the same directional word when interpreting CI, is the data statistically significant?
NO If CI values cross 1.0, data is not statistically significant
52
When the numbers that make up the CI are on the same side of 0, are the numbers statistically significant?
YES
53
Why is it okay to have a high p value for a Levene's Test?
We are okay with accepting the null because it means we are saying the variances are equal so we can thus use an interval test with our data.
54
The bigger the CI, the ____ confidence we have in the precision of our estimate. Why do we see this?
Less With 99% CI, it is harder to show significance - less likely to cause type 1 error - more confident that true value lies in interval
55
On Forest plots, the horizontal lines represent _____ and the boxes represent _____.
Confidence interval Values (our data point)
56
What is an important question to consider once you have determined your data is statistically significant?
Does statistical significance actually confer meaningful "clinical" significance?
57
What questions do we ask when selecting the correct statistical test?
1. What data level is being recording? - Does the data have order or magnitude - Does the data have an equal, consistent distance along the entire scale? 2. What type of comparison/ assessment is desired? - Frequencies/ counts/ proportions 3. How many groups are being compared? - 2 or 3 or more groups 4. Is the data independent or related (paired)? - data from the same groups = paired - data from different groups = independent
58
Correlation
Provides a quantitative measure of the strength and direction of a relationship between variables Tells you if there is a relationship or not - direction ( + or -) - magnitude (strong or weak)
59
Partial Correlation
A correlation that controls for confounding variables
60
What does a correlation of - 1.0 tell us and look like graphically?
Perfectly negative As 1 variable goes up, the other goes down - 45 degree slope
61
What does a correlation of + 1.0 tell us and look like graphically?
Perfectly positive As 1 variable goes up, the other goes up + 45 degree slope
62
Nominal Correlation Test
Contingency Coefficient
63
Ordinal Correlation Test
Spearman Correlation
64
Interval Correlation Test
Pearson Correlation p > 0.05 for a Pearson Correlation means there is no linear correlation but there could be non- linear correlations present
65
Survival Tests
Deal with event occurrence / time-to-event Compares the proportion of events over time or time-to-events between groups Predict changes in event over time Asks 'does frequency of events between groups differ over time?' - amounts can be different - rates can be different Used when we want to see changes over time
66
Survival refers to an event that has or has not occurred yet?
Has NOT occurred yet Examples: Death, hospitalization
67
What kind of curve represents survival tests and what are the features of it?
Kaplan- Meier curves If everyone starts at 100, means every patient is free of event If everyone starts at 0, means no one has had the event
68
Nominal Survival Test
Log- rank test
69
Ordinal Survival Test
Cox- proportional hazards test
70
Interval Survival Test
Kaplan- Meier Test Assesses time - time as a variable
71
Regression Test
Used to predict the likelihood of an outcome/ association Allows us to used multiple variables at one time to determine an outcome - predicting an outcome given a bunch of variables Provide a measure of the relationship between variables by allowing the prediction about the dependent (outcome) variable knowing the value/ category of independent variables Can be used to calculate the odds ratio for a measure of association
72
Nominal Regression Test
Logistic Regression
73
Ordinal Regression Test
Multinomial Logistic Regression
74
Interval Regression Test
Linear Regression
75
Pearson's Chi- square Test
Compares 2 groups of independent nominal data Compares group proportions and if they are different from those expected by chance Not powerful for small numbers - no expected cell count of less than 5 observations Asks "is the proportion of yes or no responses between groups any different?" Compares real numbers with what they expect to find based on a skewed Chi-square curve
76
Chi- square Test of Independence
Used for 3+ groups of independent nominal data Compares group proportions and if they are different from those expected by chance Not powerful for small numbers - no expected cell count of less than 5 observations Compares real numbers with what they expect to find based on a skewed Chi-square curve
77
Fisher's Exact Test
Used for 2+ groups with expected cell counts of less than 5 Nominal data test Powerful for small numbers Still Chi- square like test but handles the problem of small numbers
78
Bonferroni Test of Inequality
Used for 3+ groups of independent nominal data Used to find which groups are different in nominal data set Adjusts the p-value for the number of comparisons being made = p -value / number of comparisons you are making Very conservative Helps us reduce risk of type 1 error
79
If have statistically significant results in 3+ comparisons, you need to perform _______ ?
Subsequent analysis to determine which groups are different - subsequent analysis = post- hoc testing Cannot use multiple chi- square/ multiple statistical tests
80
Why can't you use multiple chi-square/ multiple statistical tests to determine which groups are different?
Risk of type 1 error increases with each additional test
81
McNemar Test
Used for 2 groups of paired/ related nominal data Just tells us that there is at least 1 difference between groups - doesn't tell us where but can use a Bonferroni test of inequality/ correction to find where differences are Same principles/ assumptions as Chi- square
82
Cochran Q Test
Used for 3+ groups of paired/ related nominal data Just tells us that there is at least 1 difference between groups - doesn't tell us where but can use a Bonferroni test of inequality/ correction to find where differences are Same principles/ assumptions as Chi- square
83
What are examples of key words for paired data?
Pre vs post Before vs after Baseline vs end Start vs end
84
Mann- Whitney
Used to compare the median values between 2 groups of independent ordinal data Also used for interval data not meeting parametric requirements
85
Kruskal- Wallis Test
Used to compare the median values between 3+ groups of independent ordinal data Also used for interval data not meeting parametric requirements If 3+ group comparison is significant, must perform a post- hoc test to determine where differences are
86
Wilcoxon Signed Rank
Used to compare median values between 2 groups of paired/ related data Also used for non-normally distributed interval data or data that don't meet all parametric requirements Neg sign = rank moved down Pos sign = rank moved up
87
Friedman Test
Used to compare the median values between 3+ groups of paired/ related data Also used for non-normally distributed interval data or data that don't meet all parametric requirements If 3+ group comparison significant, must perform a post- hoc test to determine where differences are
88
Student- Newman- Keul Test
Post- hoc test that is used for 3+ group comparisons of ordinal and interval data when they are statistically significant Compares all pair-wise comparisons possible All groups must be equal in size
89
Dunnett Test
Post- hoc test that is used for 3+ group comparisons of ordinal and interval data when they are statistically significant Compares all pair-wise comparisons possible against a single control All groups must be equal in size
90
Dunn Test
Post- hoc test that is used for 3+ group comparisons of ordinal and interval data when they are statistically significant Compares all pairwise comparisons possible Useful when all groups are not of equal size - Account for lost to follow ups
91
Tukey Test
Post- hoc test that is used for 3+ group comparisons of interval data when they are statistically significant Compares all pairwise comparisons possible All groups must be equal in size More conservative than Stident- Newman- Keul test
92
Scheffe Test
Post- hoc test that is used for 3+ group comparisons of interval data when they are statistically significant Compares all pairwise comparisons possible All groups must be equal in size Less affected by violations in normality and homogeneity of variances Most conservative
93
What are the Post- hoc tests that can be used when 3+ group comparisons of ordinal data are statistically significant?
Student- Newman- Keul Test Dunnett Test Dunn Test
94
What are the Post- hoc tests that can be used when 3+ group comparisons of interval data are statistically significant?
``` Student- Newman- Keul Test Dunnett Test Dunn Test Tukey Test Scheffe Test ```
95
Student t- Test
Used to compare the means of 2 groups of independent interval data
96
ANOVA (Analysis of Variance)
Used to compare the means of 3+ groups of independent interval data Also compares intra- and inter-group variations against a dependent variable If 3+ group comparisons are significant, must perform a post- hoc test to determine where the differences are
97
ANCOVA (Analysis of Co- Variance)
Used to compare the means of 3+ groups of independent interval data with confounders Compares intra- and inter- group variations of related data against a dependent variable while also controlling for the co-variance of confounders
98
Repeated Measures ANOVA
Used to compare the mean values of 3+ groups of paired/ related interval data Also compares intra- and inter-group variations of related data against a dependent variable If 3+ group comparison is significant, must perform a post-hoc test to determine where differences are
99
Repeated Measures of ANCOVA
Used to compare the mean of 3+ groups of paired/ related interval data with confounders
100
Kappa Statistic
A correlation test showing relationship or agreement between evaluators Consistency of decisions/ determinations
101
K = +1.0
Observers perfectly classify everyone exactly the same way Good agreement
102
K = 0.0
No relationship at all between the observers' classifications above the agreement that would be expected by chance
103
K = -1.0
Observers classify everyone exactly the opposite of each other Poor agreement
104
What is the National Clinical Trials number (NCT)?
A unique identifier number assigned by clinicaltrials.gov once research protocol is submitted prior to study initiation A number all studies are given once registered that allows you to track progress of study Allows us to know what’s going on and why its going on
105
What is the purpose of the NCT number?
Reduce publication bias | - not sharing results because they are bad/ don’t show anything
106
Clinicaltrials.gov is an ___ ___ ___ ___ ___ ___ which offers what kind of information?
International Committee of Medical Journal Editors Acceptable public registry that offers up to date information for locating interventional studies
107
What do readers need in order to accurately assess a study?
Complete, clear, and transparent information on the study's methodology and findings
108
What does a checklist help when reviewing medical literature?
Provides a stepwise, systematic review of the published medical literature
109
What type of studies is a CONSORT checklist used for?
Randomized interventional trials Interventional Studies
110
What does CONSORT stand for?
Consolidated Standards of Reporting Trials
111
CONSORT can be extended and used for _____?
Non-inferiority Trials Equivalence Trials Cluster Trials Pragmatic Trials
112
Define: Pragmatic Trials
Randomized, controlled trial whose purpose is to inform decisions about clinical practice Philosophy as a continuum not a dichotomy
113
What type of studies is a PRISMA checklist used for?
Systematic reviews of multiple randomized trials Interventional Studies
114
What does PRISMA stand for?
Preferred Reporting Items for Systematic Reviews and Meta-Analyses
115
What type of studies is STROBE used for?
Observational Studies | - cohort, case-control, cross-sectional
116
What does STROBE stand for?
Strengthening the Reporting of Observational Studies in Epidemiology
117
STROBE can be extended and used for _____?
Molecular Epidemiology Studies (STROBE-ME) Genetic Association Studies (STREGA) - Strengthing the reporting of genetic association studies
118
What type of studies is TREND used for?
Reporting evaluations with non-randomized designs of behavioral and public health interventions Non-randomized studies
119
What does TREND stand for?
Transparent Reporting of Evaluations with Non-randomized Designs
120
What type of studies is REMARK used for?
Tumor marker prognostic studies
121
What does REMARK stand for?
Reporting Recommendations for Tumor Marker Prognostic studies
122
What type of studies is GRIPS used for?
Genetic risk prediction studies
123
What does GRIPS stand for?
Genetic Risk Prediction Studies
124
What type of studies is STARD used for?
Diagnostic Studies Single diagnostic study
125
What does STARD stand for?
Standards for the Reporting of Diagnostic Accuracy Studies
126
What type of studies is QUADAS-2 used for?
Systematic reviews of multiple diagnostic studies Diagnostic Studies
127
What does QUADAS-2 stand for?
Quality Assessment of Studies of Diagnostic Accuracy in Systematic Reviews, 2nd edition
128
DOI number
Digital Object Identifier number Gives us location of study on internet Tells us where we can find print version of article Unique and specific to each article