Flashcards in STATS Deck (21)
When the data are normally distributed and conform to a bell-shaped curve, what are percents that ligh between the mean +/- 1, 2, and 3 SD respectively
When the data are normally distributed and conform to a bell-shaped curve, 67%, 95%, and 99.7% of the observations lie between the mean +/- 1, 2, and 3 SD respectively (choice C).
The median is
known as the 50th percentile. It is the value that lies in the center of the data points when they are arranged in numerical order:
10, 11, 11, 11, 13, 13, 14, 14, 15, 15, 16, 17, 18, 22, 24 Bottom Line: The median is also known as the 50th percentile.
Qualitative variables are divided into
nominal and ordinal variables.
A nominal variable is
named category. An example is an individual’s favorite ice cream flavor (chocolate, vanilla, or rocky road).
In the clinical setting, one often sees this broken down into a yes or no answer, as in survival of the patient or the presence or absence of a complication (bionary is nominal)
When a nominal variable consists of only two categories (such as yes and no), it is also referred to as a dichotomous variable.
An ordinal variable
usually is seen in ranking scales,
such as in injury severity scores.
An example is how miserable an individual feels on a particular day, measured on a scale of 1 to 10;
Ordinal variables are tested with
The Wilcoxon signed rank test is the most common nonparametric test used primarily to examine the differences between two paired treatments using ordinal variables.
This test determines differences between treatments for nominal variables (ice cream flavor stats).
This test is used to compare two unpaired treatments using quantitative variables.
The Wilcoxon signed ranked test
This test is used to compare two paired treatments, using ordinal variables.
two paired (ranked) tests to compare number outcomes
the number of specified new events, e.g. number of people falling ill with a specified disease, during a specified period, in a specified population. people years
the number of cases of a disease existing in a given population at a specific period of time or at a particular moment in time.
Normally distributed data is
and thus conforms to a bell-shaped curve
It is symmetric about the mean and all three measures of central tendency: the mean, median, and mode are generally equal or very close in value
, such as Student’s t test, mandate several assumptions about the variables being considered, namely that they are normally distributed.
Parametric tests are generally considered to be more powerful than nonparametric tests (choice E) and measurements should be, at a minimum, on an interval scale.
Nonparametric data, on the other hand, is generally skewed and therefore does not fit a bell curve. Discrete data, such as that measured on a nominal or ordinal scale (choice C), consists of only a few possible values or categories and will not follow the normal distribution. It should therefore be analyzed using methods other than Student’s t test (choice D). Assessment of nonparametric data generally involves the use of contingency tables and significance testing uses methods such as the Mann-Whitney U (also known as Wilcoxon rank-sum) or Chi Square tests.
the proportion of patients who have a disease (D+) that will test positive for the disease (T+).
The higher the specificity of a test
the lower the likelihood of obtaining a false-positive result. Specificity, therefore, is also equal to 1 – the false-positive rate.
This makes it a good screening test because if it says you are negative you are negative
Positive predictive value
gives the percentage of patients with a positive test result who actually have the disease
low positive predictive value will have a high likelihood of producing false positives and will require confirmatory testing with a more reliable test.
calculated by dividing the number of true positives by the total number of positive test results.
Type II errors
when a statistically significant difference DOES exist between the population means and yet it is not detected
and The null hypothesis (H0) is inappropriately accepted.
The probability of committing a type II
error is estimated by
BETA (type 2 = b)
generally set at 0.20
(i.e., the researcher accepts a 20% chance that they will incorrectly accept the null hypothesis).
A common reason of committing a type II error is
the study sample is too small to detect the difference that actually exists.
Type I errors are committed by
This is the number one error seen in publications (because people don't publish non significant results..)
final conclusion states that a statistically significant difference exists between the population means when, in fact, one does not
and The null hypothesis H0 is incorrectly rejected.