What type of sample ensures sound inferences?
- Representative of the population
How to avoid biased results
Characteristics of the normal distribution
z-value
Normalized value.
z = (x - mu)/sd
Central Limit Theorem
if we take enough sufficiently large samples from any population, the means of those samples will be normally distributed, regardless of the shape of the underlying population.
Distribution of Sample Means
Confidence Interval
Range around the sample mean to estimate the range in which the true population mean likely lies.
Size of sample
Large samples => greater than or equal to thirty.
Small sample => t-distribution - shorter, wider than normal distribution.
Large => normal distribution
Confidence intervals for proportions
Need to create a dummy (0, 1) variable.
And then proceed as we would with any other variable.
Ensuring sample size is large enough for estimating population proportion
n.p >= 5, and
n(1-pbar) >=5
[EXCEL] Generating random number between 0 and 999
=RAND()
[EXCEL] Cumulative probability for normal distribution
=NORM.DIST(x, mean, sd, cumulative)
cumulative = TRUE
[EXCEL] Cumulative probability for standardized normal distribution
=NORM.S.DIST(z, cumulative)
cumulative = TRUE
[EXCEL] Corresponding x-value for probability on a specified normal distribution
=NORM.INV(probability, mean, sd)
[EXCEL] Margin of error for normal distribution
=CONFIDENCE.NORM(alpha, sd, sample_size)
For sample size greater than or equal to 30
[EXCEL] Margin of error for t distribution
=CONFIDENCE.T(alpha, sd, sample_size)
For sample size less than 30
[EXCEL] Creating conditional subset
=IF(logical_test, [value_if_true], [value_if_false])