Class 6 Spring π· Flashcards
What are the two types of data that can be analyzed after sampling from a larger population?
Categorical and numerical data
What are some ways to visualize data trends and distribution shape?
Table or graph (e.g., frequency plot, bar chart, histogram, box-and-whisker, scatterplot)
What summary statistics are commonly calculated?
- Central tendency (mean, median, mode)
- Dispersion (range, IQR, standard deviation)
What is the relationship between the type of table or graph and the nature of the variables?
It depends primarily on whether the variables are numerical or categorical
What shape does a normal (Gaussian) distribution always have?
Bell-shaped and symmetrical around a central mean
What does the area under the normal distribution curve always equal?
1
What does the 68-95-99.7 Rule (Empirical Rule) describe?
- About 68% falls within 1 SD of the mean
- About 95% falls within 2 SD of the mean
- About 99.7% falls within 3 SD of the mean
What is the formula for calculating a z-score?
(observation β population mean) / standard deviation
What is the standard normal distribution notation?
N(Β΅ = __, Ο = __)
How can z-scores be used in relation to different distributions?
To compare how unusual two measurements are, even when looking at different normal distributions
What does a z-score represent?
The number of standard deviations an observation is above or below the mean
True or False: Z-scores can be used for distributions of any shape.
False
What is the probability that a randomly sampled US adult is between 180 cm and 185 cm estimated by?
The shaded area under the curve
What statistical function in R is used to calculate the lower tail area based on a z-score?
pnorm()
Fill in the blank: The area under the normal curve always adds up to _______.
1
What happens to real-world data distributions compared to the normal distribution?
They will never produce a perfect curve
What does a z-score of more than 3 signify?
Very unusual observations
What is the mean and standard deviation for SAT scores?
Mean = 1500, Standard Deviation = 300
What is the mean and standard deviation for ACT scores?
Mean = 21, Standard Deviation = 5
What is the role of parameters in the normal distribution?
Parameters (mean Β΅ and standard deviation Ο) describe the normal distribution perfectly
What method is used to find the probability of a measurement falling above a particular cutoff value?
1 - pnorm(value)
What is the probability that a random student scores at least 1190 on her SATs calculated using?
1 - pnorm(z-score)
What is the mean height of male adults in the US?
70.0β
What is the standard deviation of heights for male adults in the US?
3.3β