AP Exams Flashcards
(32 cards)
Interpret a z-score
A z-score describes how many standard deviations a value or statistic (x, x, P, etc.) falls away from the mean of the distribution and in what direction.
The further the z-score is away from zero the more
“surprising” the value of the statistic is.
Describe the Distribution
OR
Compare the Distributions
SOCS!
Shape, Outliers, Center, Spread
Only discuss outliers if there are obviously outliers present. Be sure to address SCS in context!
If it says “Compare”
YOU MUST USE comparison phrases like “is greater than” or “is less than” for Center & Spread
Geometric Distribution
(Conditions)
- Binary? Trials can be classified as success/failure
- Independent? Trials must be independent.
- Trials? The goal is to count the number of trials until the first success occurs
- Success? The probability of success (p) must be the same for each trial.
Outlier Rule
Upper Bound = Q3 + 1.5(1QR)
Lower Bound = QI - 1.5(IQR)
IQR = Q3 - li
What is a Residual?
Residual = y - y
A residual measures the difference between the actual (observed) y-value in a scatterplot and the y-value that is predicted by the
LSRL using its corresponding x value.
* In the calculator: L3 = L2 - Y,(L1)
Inference for Proportions (Conditions)
Random: Data from a random samples) or randomized experiment
Normal: At least 10 successes and failures (in roth groups, for a two sample problem)
Independent: Independent observations and independent samples/groups; 10% condition if sampling without replacement
Type I Error, Type II Error,
& Power
- Type I Error: Rejecting Ho when Ho is actually true. (Ex. Convicting an innocent person
- Type II Error: Failing to (II) reject Ho when Ho should be rejected. (Ex. Letting a guilty person go free)
- Power: Probability of rejecting Ho when Ho should be rejected. (Rejecting Correctly)
Mean, and Standard
Deviation Of a
Binomial Random Variable
Also on the formula sheet!
Mean: μx = np
Standard Deviation:
σx = пp(I-p)
Inference for Regression (Conditions)
Linear: True relationship between the variables is linear.
Independent observations, 10% condition if sampling without replacement
Normal- Responses vary normally around the regres.
line for all x-values
Equat var ‘ance around the regression line for all x-values
Landom: Data from a random sample or randomized
experiment
Mean and Standard
Deviation of a
Discrete Random Variable
Also on the
formula
Mean (Expected Value):
sheetl
M. = [x,P.
(Multiply & add across the table)
Standard Deviation:
0, = VI(x, - M.) P.
Square root of the sum of (Each x value - the mean) (its probability)
Inference for Means (Conditions)
Random: Data from a random samples) or randomized experiment
Normal: Population distribution is normal or large samples) (n, ≥ 30 or n, ≥ 30 and n2 ≥ 30)
Independent: Independent observations and independent samples/groups; 10% condition if sampling without replacement
Interpreting
Expected Value/Mean
The mean/expected value of a random variable is the long-run average outcome of a random phenomenon carried out a very large number of times.
Binomial Distribution
(Conditions)
- Binary? Trials can be classified as success/failure
- Independent? Trials must be independent.
- Number? The number of trials (n) must be fixed in advance
4: Success? The probability of success (p) must be the same for each trial.
Interpret
LSRL y-intercept “a”
When the x variable (context) is zero, the y variable (context) is estimated to be put value here.
Using Normal cdf and
Invnorm
(Calculator Tips)
Normalcdf (min, max, mean, standard deviation)
Invnorm (area to the left as a decimal, mean, standard deviation)
Interpret r
Correlation measures the strength and direction of the linear relationship between x and y.
• r is always between -1 and 1.
• Close to zero = very weak,
• Close to 1 or -1 = stronger
• Exactly 1 or -1 = perfectly straight line
• Positive r = positive correlation
• Negative r = negative correlation
Interpret
LSRL Slope “b”
For every one unit change in the x variable (context)
the y variable (context)
is predicted to increase/decrease by [ ] units (context).
Chi-Square Tests
df and Expected Counts
- Goodness of Fit:
df = # of categories - 1
Expected Counts: Sample size times hypothesized proportion in each category. - Homogeneity or Association/Independence:
df = (# of rows - 1)(# of columns - 1)
Expected Counts: (row total)(column total)
table total
Interpret
Standard Deviation
Standard Deviation measures spread by giving the “typical” or “average” distance that the observations (context) are away from their (context) mean
Experimental Designs
- CRD (Completely Randomized Design) - All experimental units are allocated at random among all treatments
- RBD (Randomized Block Design) - Experimental units are put into homogeneous blocks.
The random assignment of the
units to the treatments is carried out separately within each block. - Matched Pairs - A form of blocking in which each subject receives both treatments in a random order or the subjects are matched in pairs as closely as possible and one subject in each pair receives each treatment, determined at random.
Complementary Events
Two mutually exclusive events whose union-is the sample space.
A
Ex: Rain/Not Rain,
Draw at least one heart / Draw NO hearts
Sampling Techniques
- SRS- Number the entire population, draw numbers from a hat (every set of n individuals has equal chance of selection)
- Stratified - Split the population into homogeneous groups, select an SRS from each group.
- Cluster
- Split the population into heterogeneous groups called clusters, and randomly select whole clusters for the sample. Ex. Choosing a carton of eggs actually chooses a cluster (group) of 12 eggs. - Census - An attempt to reach the entire population
- Convenience
Selects individuals easiest to reach - Voluntary Response -
- People choose themselves by
responding to a general appeal.
I am from
% confident that the interval captures the true
STATE: What parameter do you want to estimate, and
at what confidence level?
PLAN: Choose the appropriate inference method.
Check conditions.
DO: If the conditions are met, perform calculations.
CONCLUDE: Interpret your interval in the
context of the problem.
Standard Deviation measures spread by giving the “typical” or “average” distance that the observations (context) are away from their (context) mean - Is there a curved
Interpreting
a Residual Plot
- Is there a curved pattern? If so, a linear model may not be appropriate.
- Are the residuals small in size? If so, predictions using the linear model will be fairly precise.
- Is there increasing (or decreasing) spread? If so, predictions for larger (smaller) values of x will be more variable.
Explain a P-value
Assuming that the null is true (context) the P-value measures the chance of observing a statistic (or difference in statistics) (context)
as large as or larger than the one actually observed.