Statistics I & II Flashcards

1
Q

Parameter

A

describes the whole population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Is a parameter fixed or does it vary? Why?

A

fixed because you know data about everyone in the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Statistic

A

describes a sample of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Is a statistic fixed or does it vary? Why?

A

varies because you can take different samples from the same population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Is a parameter or a statistic more reliable?

A

parameter

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the issue with using a parameter instead of a statistic?

A

if the population you are testing is large, the testing requires more time and resources

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are descriptive statistics used to describe?

A

central tendency, variability, & shape of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Central tendency

A

where the middle of the data lies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the “king” measurement for central tendency - mean, median or mode? Why?

A

mean because you can do the most stats w/it

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the mean affected by?

A

Outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

If your data has outliers, which measurement is used instead of mean?

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Median

A

the middle most number in a set of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Mode

A

the most frequently occurring score(s) in a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Mean (average)

A

the sum of the scores divided by the number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variability

A

the spread or dispersion of a set of research data or distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

If you have test scores that are all over the place (10’s, 35’s, and 100’s), yet most scored in the 80’s, would you have a lower or higher variability?

A

higher variability

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Range

A

the difference between the highest and lowest scores in a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Percentiles

A

describes a score’s position within a distribution compared to others on a scale of 0-100%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

If you are in the 75th percentile, you scored ___________ than ________ of others.

A

75%
higher

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Quartiles

A

divide distributions into four equal parts

Q1 = 25th percentile
Q2 = 50th percentile
Q3 = 75th percentile
Q4 = 100th percentile

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Interquartile range

A

The difference between the upper and lower quartiles.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

Standard deviation

A

represents the spread/dispersion/variability of scores relative to the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Coefficient of variation compares what?

A

compares 2 SD’s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the “king” measurement for variability?

A

standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Which type of measurement is used to describe nominal data?
mode
26
Which type of measurement is used to describe ordinal data?
median & interquartile range or mode
27
Which type of measurement is used to describe interval and ratio data?
mean & standard deviation or mode & interquartile range
28
What is a bell curve used to describe?
shape
29
How much data lies between -1SD and 1SD of the mean?
68.3
30
How much data lies between -2SD and 2SD of the mean?
95.5%
31
How much data lies between -3SD and 3SD of the mean?
99.7%
32
In a bell curve with a positive skewed distribution, the mean gets pulled to the ___________ creating a tail to the __________. Why?
right, right the mean is heavily affected by outliers, so it gets pulled further toward the right
33
In a bell curve with a negative skewed distribution, the mean gets pulled to the __________ creating a tail to the ___________. Why?
left, left the mean is heavily affected by outliers, so it gets pulled further toward the left
34
When the data is skewed in a bell curve, the __________ is often used to describe the central tendency. Why?
median because it is not as impacted by outliers as the mean is
35
Inferential statistics
allow us to estimate unknown population traits from using a sample When you have collected data from a sample, you can use inferential statistics to understand the larger population from which the sample is taken.
36
Probability
likelihood that an event will occur given all possible outcomes it is what SHOULD happen, not what WILL happen
37
What is probability used for?
used to determine if observed effects are likely to have occurred by chance
38
Probability is described as _______ and is in ________ form.
"p", decimal
39
Simply put, what is sampling error accounting for?
our sample will never fully reflect the entire population
40
Will a larger or smaller sample be more representative of the entire population?
larger
41
Null hypothesis
the observed difference (or effect) occurred by chance
42
Example of a null hypothesis
There will be NO STATISTICAL DIFFERENCES in strength between open and closed chain exercises.
43
Statistical tests will always test the ________ hypothesis.
null
44
Example of an alternative hypothesis
There will be SIGNIFICANT DIFFERENCES in strength between open and closed chain exercises.
45
Alternative hypothesis
the observed difference (or effect) could not occur by chance
46
When we run a statistical test, we either ____________ the null hypothesis or do not _________ the null hypothesis.
reject
47
Type I error
false-positive findings which means that the research found a difference when there isn't really a difference
48
Type II error
false-negative findings which means that research did not find a difference when there is a difference present
49
In a small sample size, which type of error is more likely?
type II
50
In a large sample size, which type of error is more likely?
type I
51
You decide to get covid tested and your results came back positive. However, it turns out that you don't actually have covid. What type of error is this?
type 1
52
You decide to get covid tested and your results came back negative. However, it turns out that you do have covid. What type of error is this?
type II
53
Level of significance (alpha)
determines how strict one is with rejecting the null hypothesis maximal acceptable risk of making a type I error Probability of a Type 1 error Rejecting Null hypothesis when its true
54
How would an individual reduce the chance of a type I error occurring?
reduce the level of significance (alpha)
55
Beta
probability of making a type II error
56
Statistical power (1-beta)
probability of finding statistically significant differences
57
What does the statistical power depend on?
alpha, sample size, effect size
58
How would an individual reduce the change of a type II error occurring?
lower beta or increase statistical power
59
What does the p-value tell you?
probability that your results are due to chance when testing between and/or within groups How likely ur data could have occurred under the null hypothesis.
60
If p = 0.10 this means that
there is a 10% probability that your results are due to chance.
61
If p < 0.05 this means that
there is less than a 5% probability that your results are due to chance.
62
If p < 0.01 this means that
there is less than a 1% probability that your results are due to chance.
63
The lower the p-value, the more confident you are that your results are _______ due to chance.
NOT
64
If you have a p-value = 0.10, does this mean that there is a 90% chance that your results are accurate?
no - there is 10% probability that your results are due to chance BUT you cannot reverse this & assume the opposite is true
65
What is the p-value dependent upon?
sample size
66
What are alternative forms of measurement to a p-value?
confidence intervals & effect size
67
Confidence Intervals (CI)
range of numbers for which you expect the true population to fall
68
If you have a 95% CI = (a, b), what does this mean?
you are 95% confident that the true difference between groups is between a and b
69
"Is there a difference?"
p-value
70
"How big is the difference?"
effect size
71
Larger effect size =
larger difference
72
What is the first step when choosing a statistical test?
determine the purpose of analysis what are you trying to do...? - predict - relationship - significance of difference
73
What is the second step when choosing a statistical test?
determine scale of dependent variable is it...? nominal, ordinal, interval or ratio
74
What are the 3 assumptions of parametric data?
1. normality: data follows normal distribution 2. equal variances: same spread 3. typically ratio or interval data & represented as mean
75
What are the 3 assumptions of nonparametric data?
1. no need for normality or equal variance 2. ordinal or nominal data 3. typically presented as median
76
When you see the word predict, which test should we automatically think of?
regression
77
What is R^2?
measure of how strong the prediction is. total variance of y (dependent) that can be explained by x (independent)
78
What are the 3 predicting factors?
1. likelihood ratios (LR) 2. relative risk (RR) 3. odds ratio (OR)
79
Sensitivity test
how often a test gives a positive result in people with a condition - lots of true positives - few false negatives
80
Specificity test
how often a test gives a negative result in people without a condition - lots of true negatives - few false positives
81
If a test has 92% sensitivity and you obtain a negative result, you can confidently rule ______ the condition.
out
82
If a test has a 95% specificity and you obtain a positive result, you can confidently rule ______ the condition.
in
83
What two clinical questions do likelihood ratios help answer?
1. How sure am I that a person w/a positive test result actually has what I tested him/her for? 2. How sure am I that a person w/a negative test result actually does not have what I tested him/her for?
84
+ LR =
sensitivity / (1-specificity)
85
- LR =
(1-sensitivity)/specificity
86
Relative Risk
compares one risk to another
87
Relative risk = 1
the risk in the exposed group is the same as the risk in the unexposed group, so no indication of benefit or harm
88
Relative risk < 1
the exposure is associated w/a protective effect
89
Relative risk > 1
exposed group has a greater risk of contracting the disease, so exposure is associated w/harm
90
If you have a RR of 0.5, what does this mean?
you are at a 50% lower risk than the average
91
If you have a RR of 1.5, what does this mean?
you have a 50% higher risk than the average
92
In order to put RR into context, what do you need to know?
the baseline risk for the disease
93
Odds Ratio
compares whether the odds of a certain event happening is the same for two groups
94
OR = 1
the event is equally likely in both groups
95
OR > 1
event is less likely in the first group
96
OR < 1
event is less likely in the first group
97
What is the measure of choice in a case-control study?
odds ratio