Chapter 6 Flashcards
(15 cards)
Law of large numbers
As the sample size N increases, the sample mean (xbar) gets closer to the population mean (u)
-bigger= More accurate average
more flips = closer to true probability
Gilvenko Cantelli theorem
As the sample size n increases, the sample distribution gets closer to the population distribution
Ex: imagine drawing 1000 stick figures-at first they look messy but as you keep drawing, they get closer to looking like a real person (the true population shape)
Central limit theorem
As sample size N increases the sampling distribution gets closer to a normal distribution
we say N>30 is large enough to use unless the population is extremely skewed
Linear combination of normal random variables
If the population is normally distributed, then the sampling distribution is normally distributed, no matter what n is
Sample mean (x bar)
It is a statistic computed from a sample and used to estimate the population mean
Sampling distribution
The probability of distribution of the sample across all possible samples of the population
Meaning of the sampling distribution
-expected value of x bar is equal to the population mean
Standard error
-the standard deviation of the sampling distribution
Variability of x bar
Sample mean, varies from sample to sample, but we can quantify this variability and make inferences about the population mean
Population distribution
-refers to the actual distribution of the actual values in the population. Pattern or shape of the entire data set that you’re drawing samples from. [Normal distribution, skewed distribution]
Sample distribution
-distribution of the values within one specific sample drawn from the population, a single sample might not fully represent the population, especially if the sample is small
Sampling distribution
-refers to the distribution of all possible sample means that could be obtained from samples of size N drawn from the population
Ex: if you took many different samples (1000 samples) each of size 10, and calculated the meaning for each sample, the sampling distribution would describe the distribution of those 1000 sample means
Ex; imagine you’re studying the weight of apples from a farm. The population distribution of Apple weights could be skewed, but there are a very few large apples that pull them in higher. If you take a small sample of apples, the sample distribution might not reflect the true pattern of Apple weights accurately. However, if you were to take many random samples of a larger size (like n = 100) the sampling distribution of the sample mean would tend to be more normal, even though the population distribution is skewed, the sampling distribution would still tend towards a normal shape as sample size increases.
Summary
-if the population is normal, you do not need a large sample to approximate the population mean with the sample mean the sampling distribution of the sample mean will be normal regardless of the sample size
-If the population is not normal, larger samples are needed for the sampling distribution of the sample meaning to approximate a normal distribution. This is why larger samples are essential when the population distribution is skewed or non-normal.
-the sampling distribution of the sample mean gets closer to a normal distribution as the sample size increases, even if the population distribution is not normal.