Chapter 7: Sampling Distributions Flashcards
Define Parameter
a number that describes some characteristic of the population
Define Statistic
a number that describes some characteristic of a sample. the value of a statistic can be computed directly from the sample data
symbols for
population mean
sample mean
proportion population
proportion sample
mu
x bar
p
p hat
Define Sampling Variability
the value of a statistic varies in repeated random sampling
Define Sampling Distribution of a statistic
the distribution of values taken by the statistic in all possible samples of the same size from the same population
Define Variability of a Statistic
described by the spread of its sampling distribution
This spread is determined primarily by the size of the random sample.
larger samples give smaller spread.
Define Bias
a statistic used to estimate a parameter is an unbiased estimator if the mean of its sampling distribution is equal to the true value of the parameter being estimated
bias = our aim is off and we consistently miss the bull’s-eye in the same direction. out sample values do not center on the population value.
Sampling Distribution of a Sample Proportion
mean: p hat = p
standard deviation: sigma (p hat) = sqrt ((p(1-p))/n) [measures the typical distance between a sample proportion and the population proportion p in samples of size n.
conditions:
1. 10% n<1/10N [allows the use of sigma (p)]
2. large count: np≥10 and n(1-p) ≥ 10 [allows to assume that sampling distribution is approximately normal]
Describe the change in sampling distribution once sample size decreases
spread would increase since smaller sample size = more variability
standard deviation would increase
large count might fail –> not approximately normal
finding the smallest sample size [Sampling Distribution of a Sample Proportion]
np≥10 and n(1-p)≥10
Sampling Distribution of a Difference Between Two Proportions Mean
mean: p hat1-p hat2 = p1 - p2
standard deviation: sigma (p hat1 - p hat2) = sqrt (p1(1-p1)/n1 + p2(1-p2))/n2) [measures the typical distance between a sample proportion and the population proportion p in samples of size n.
conditions:
1. 10% n1<1/10N1 n2<1/10N2 [allows the use of sigma (p)]
2. large count: n1p1≥10 and n1(1-p1) ≥ 10 and n2p2≥10 and n2(1-p2) ≥ 10[allows to assume that sampling distribution is approximately normal]
Central Limit Theorem
when n is sufficiently large, the sampling distribution of the sample mean [x bar] is approximately normal.
Sampling Distribution of a Difference Between Two Means
mean: mu hat1-p hat2 = mu 1 - mu 2
standard deviation: sigma (x bar1 - x bar2) = sqrt (sigma 1 ^2 / n1 + sigma 2 ^2 / n2) [measures the typical distance between a sample proportion and the population proportion p in samples of size n.
conditions:
1. 10% n1<1/10N1 n2<1/10N2 [allows the use of sigma (p)]
2. large count:
- both populations are normal so the sampling distribution of x bar1 - x bar2 is normal
- both samples exceeds the size of 30 or one population is normal and the other has sample size greater than 30.