Lecture 5 - Biostatistics Part 1-2 Flashcards

1
Q

What are the Measures of Central Tendency?

A

Mean - the “average” – sum of the set divided by the number in the set

Median – the middle point (arrange the data smallest to largest, then find the middle point)

Mode – the score that occurs most frequently in a set of data
—-May have two most common values = “bimodal distribution”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

the “average” – sum of the set divided by the number in the set

A

Mean -

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

– the middle point (arrange the data smallest to largest, then find the middle point)

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

– the score that occurs most frequently in a set of data

A

Mode

—-May have two most common values = “bimodal distribution”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mode

—-May have two most common values =

A

“bimodal distribution”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q
  • The point/score at which 50% of scores fall below it and 50% fall above it
A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

This is the most general and least precise measure of central tendency

When two values occur the same number of times – Bimodal distribution

A

Mode - most frequently occurring value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

look at mean median mode images on slide 10

A

!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Standard deviation

A

a measure of variation of scores about the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Variance compared to…

A

standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

quantifies the amount of variability, or spread, around the mean of the measurements.

To calculate: take each difference from the mean, square it, and then average the result

A

Variance (σ2 )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

To calculate the variance:

A

take each difference from the mean, square it, and then average the result

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

: a measure of variation of scores about the mean

A

Standard deviation (σ)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

To calculate the standard deviation:

A

take the √ of the variance
(the “average distance” to the mean)

In practice, the standard deviation is used more frequently than the variance.

Primarily because the standard deviation has the same units as the measurements of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

When comparing two groups, the group with the larger standard deviation exhibits a greater amount of _____ while the groups with smaller deviation has less _______.

A

variability (heterogeneous)

variability (homogeneous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Empirical rule for data (68-95-99) - only applies to a set of data having a distribution that is approximately bell-shaped:

A

Approximately 68% of all scores fall with 1 standard deviation of the mean

Approximately 95% of all scores fall with 2 standard deviations of the mean

Approximately 99.7% of all scores fall with 3 standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Scatterplots:

A useful summary of a set of ________-

A

bivariate data (two continuous variables)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Scatterplots:

Gives a good visual picture of the relationship between the two variables, and aids the interpretation of the _____

A

correlation coefficient or regression model.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

the statistic that summarizes the relationship between the variable on the x axis and the variable on the y axis

A

correlation coefficient

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Perfect Positive correlation

A

X increases and Y increases at the same rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Perfect Negative correlation

A

X increases and Y decreases yet at the same rate

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

X increases and Y increases =

A

positive correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

X increases and Y decreases =

A

negative correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

0 value for correlation coefficient means there is

A

no correlation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
Represented by “r ” (rho) The absolute value of the coefficient (its size, not its sign) tells you how strong the relationship is between the variables.
Correlation Coefficient
26
Tells us how strongly two variables are related
Correlation Coefficient
27
Correlation Coefficient “r” can not be > 1 or < -1 Closer to -1 or +1: ? Closer to 0 : ?
Closer to -1 or +1: the stronger the relationship Closer to 0 : the weaker the relationship
28
The most common measure of association. Results can misleading if the relationship is non-linear.
Pearson Correlation
29
______ correlation is very sensitive to outlying values.
Pearson’s
30
______ non-parametric version of Pearson’s correlation. The calculation is based on the ranks of the data points of the x and y values.
Spearman Correlation:
31
The statement that establishes a relationship between variables being assessed Example: In a clinical trial the hypothesis states the new drug is better the placebo
Alternative hypothesis (Ha or H1)
32
The statement of no difference or no relationship between the variables Example: In a clinical drug trial the null hypothesis states that the new drug is no better than placebo
Null hypothesis (Ho)
33
AKA the “research hypothesis”
Alternative hypothesis
34
Hypothesis stating the expected relationship between independent and dependent variables.
Alternative hypothesis
35
If there IS a statistically significant difference, then the researchers “ACCEPT” the alternative hypothesis and “REJECT” the null hypothesis.
Alternative hypothesis
36
If there IS a statistically significant difference, then the researchers must “REJECT” the Null hypothesis
Null hypothesis
37
If there IS NOT a statically significant difference, then the researchers must “RETAIN” or “FAIL to REJECT” the Null hypothesis.
Null hypothesis
38
Two kinds of errors can be made when we conduct a test of hypothesis.
This first is called a Type I error; also known as a rejection error or an α error. The second kind of error that can be made during a hypothesis test is a Type II error, also known as an acceptance error or an β error.
39
A _____ error is made if we reject the null hypothesis when null hypothesis is true.
type I
40
The probability of make a type I error is determined by the ______ of the test.
significance level
41
The second kind of error that can be made during a hypothesis test is a Type II error, also known as an _____ error or an _____ error.
acceptance error or β error
42
A Type 2 error is made if we....
...fail to reject null hypothesis.
43
The probability of committing a type II error is represented by...
the Greek letter β.
44
The probability of finding an effect The probability of correctly rejecting the null hypothesis The probability of seeing a true effect if one exists Designers of studies typically aim for a power of 80% or 0.8 Implies there is an 80% chance of getting it right Generally speaking: More people = more power
Statistical power
45
The probability of correctly rejecting the null hypothesis
Statistical power
46
The probability of seeing a true effect if one exists
Statistical power
47
Designers of studies typically aim for a power of 80% or 0.8 Implies there is an 80% chance of getting it right
Statistical power
48
______ calculates the number of participants a study must have to draw accurate conclusions Takes into consideration: estimated effect size, sample means, etc.
power analysis
49
what does a power analysis take into consideration?
Takes into consideration: estimated effect size, sample means, etc.
50
is the probability of avoiding a type II error.
Power
51
Power may also be thought of as the likelihood that a particular study will detect a _____________ given that one exists.
deviation from the null hypothesis
52
If the β is the probability of making a type II error, 1- β is called the....
power of the test of hypothesis.
53
1- β is called....
the power of the test of hypothesis.
54
Statistical significance and p-value How do we determine if the study result happened by chance alone? What determines statistical “significance”?
Significance The probability of rejecting a true H0 α = .05 usually set, acceptable error -----Chance that 5 times out of 100 the H0 would be falsely rejected
55
Statistical significance and p-value
Probability level The likelihood that the difference observed between two interventions could have arisen by chance Accepted value is 5% risk (p = .05) Means there is a 5% chance that the results happened by chance Allows us to reject or accept the null hypothesis
56
Statistical significance and p-value p is... α is the acceptable error, usually = .05 p ≤ α _____ the H0 (null hypothesis) the results... p > α _____ the H0 (null hypothesis) the results....
the chance of random error α is the acceptable error, usually = .05 p ≤ α reject the H0 (null hypothesis) the results are statistically significant p > α fail reject the H0 (null hypothesis) the results not statistically significant
57
Less than 5% chance the effect happened by chance. “There is less than a 5% chance that the null hypothesis is falsely rejected” Often reported as “significant at the 0.05 level.”
A statistically significant p = or < .05
58
If the number of subjects is small, p value tells us that the effect was either large or consistent (or both) If the number of subjects is large, the effect size may not be that large
A highly significant p < .001
59
If the number of subjects is small, there might not have been enough subjects to find a difference that truly does exist If the number of subjects is large, we can be confident that either there is no difference between treatments, or the treatment effect is not consistent
An insignificant p > .05
60
Statistical significance and p-value Depends on several factors... As all of these factors increase, the likelihood of finding statistical significance increases
How large the effect was How consistent the effect was How many patients were studied ---As all of these factors increase, the likelihood of finding statistical significance increases
61
What's the difference between…. | P value of .051 and .049
Look at the slides if you don't know for sure.... slide 28 in particular