Chapter 3 Flashcards
(19 cards)
A numerical summary of a population of interest
population parameter
A numerical summary of a sample
statistic
is the middle value of the data set
median
The value that occurs most often in a data set
mode
is the class with the highest frequency
model class
is the average of the maximum and minimum of the data set.
midrange
It measures how far apart the most extreme values in a data set are
range
is the average squared distance that observations are from the mean
sample variance
is the average distance, or deviation, that observations are from the mean
sample standard deviation
measures the spread of a data set relative to its mean
coefficient of variation
describes the number of standard deviations that the value is above or below the average value.
z score or standard score
is the 25th percentile
first quartile Q1
is the 75th percentile
third quartile Q3
is the 50th percentile
median MD
denoted p1, p2. divide the data set into equal groups
percentiles
to depict the distribution of the data with a boxplot of the 5 number summary or with a stem and leaf plot
exploratory data analysis
a data set is composed of: minimum, Q1, MD, Q3, maximum
5 number summary
is a plot of the 5 number summary that depicts the middle half of the data with a box and the outside half of the data with lines
box plot
are identified as any number larger than Q3 + 1.5IQR or smaller than Q1 - 1.5IQR
outliers