AP Statistics Exam Review Flashcards
What graphs are appropriate for quantitative data?
dotplot, histogram, stemplot, boxplot
What graphs are appropriate for categorical data?
bar graph, pie graph (not in AP curriculum), 2-way table (I know–it’s not a graph)
When creating a graph by hand, always include these:
- labels for both axes.
2. numerical scales, with equal intervals labeled, on BOTH axes.
What is the difference between marginal and conditional distributions?
Marginal distributions are made from numbers in the MARGINS. Conditional distributions are from single rows or columns that are NOT in the margins.
When describing (or comparing) distributions, ALWAYS address:
shape, center, spread and outliers (SOCS)
What can usually be determined from a boxplot?
range, IQR, quartiles, MEDIAN
What can NOT be determined from a boxplot?
shape (at least not a complete description) and sample size
When writing COMPARISON statements, always be sure to
use COMPARATIVE language (“larger than…”, “both have…”, “more skewed than…,” “neither shows…,” etc.)
Stem plots require this for full credit:
A key. (Example: 4|3 = 43).
Features of a histogram:
equal bar (bin) widths, x-axis is a continuous number line, different bin widths may show different features of a distribution, Xscl in TI Window will change bin widths
When to use Mean vs. Median?
Generally use means with non-skewed data. Use medians with skewed data or data with outliers.
When is the mean higher than the median?
Generally, this happens when the data is skewed right, or has high outliers.
How can you estimate mean and median from a distribution?
The mean is the “balance point” if the distribution was made out of a solid material. The median is the “equal areas” location in a distribution.
What is standard deviation?
It is the “typical” (or average) deviation from the mean in a dataset.
When should IQR be used as a measure of spread instead of standard deviation?
IQR should be used when the data is skewed or has outliers. Standard deviation should be used when the data is roughly symmetric with no outliers.
What is the rule for determining outliers?
An outlier is more than 1.5 IQR’s away from the nearest quartile.
What is the percentile of x?
The percentage of the data that is less than x in a distribution (“less than or equal” is also acceptable).
What is “frequency” vs. “relative frequency?”
Frequency is counts (whole numbers). Relative frequency is percentage.
What is a standardized score?
The number of standard deviations from the mean.
How do you calculate a z-score?
z = (x – mean) ÷ (standard deviation)
What statistics/measurements change when you multiply a dataset by a constant?
All statistics/measurements change by this same factor (or divisor).
What statistics/measurements change by adding (or subtracting) a constant to all data?
Only measures of location change. Measures of spread are not affected.
What does a density curve show?
Overall patterns of a distribution are depicted. Also, the area under the curve is 1 (100%), so percentiles can also be depicted.
N(34, 4.2) means
a normal distribution with mean 34 and standard deviation 4.2