chapter 3 Flashcards
(28 cards)
percentiles
measures of central tendency that divide a group of data into 100 parts
quartiles
measures of central tendency that divide a group of data into four subgroups or parts
differences between mean, median, and mode
mean: can apply only to quantitative data
median: can apply to ordinal-level qualitative data and quantitative data
mode: can apply to nominal-level qualitative and quantitative data (can be bimodal and multimodal)
advantages and disadvantages of mean
+ uses all the data and each data item influences the mean
- affected by outliers
advantages and disadvantages of median
+ not affected by outliers
- not all info from the numbers is used; only considers the middle value
advantages and disadvantages of mode
+ easy to determine
- data set my not have a mode
Is the arithmetic mean greatly affected by any extreme value or values? Explain.
Yes, the mean is affected by extreme data values.Consider the mean salary of employees. A CEO or executive making over $500,000 in a company will increase the mean salary of employees and therefore a mean salary is not indicative of most of the employees’ salary.
can you contrive a small set of data with no mode?
Any data set, regardless of size, with unique items in it will have no mode.
frequency distribution
summary of data presented in the firm of class intervals and frequencies
relative frequency
proportion of the total frequency that is in any given class interval in a frequency distribution
range
difference between the largest value and smallest value of a data set
is range affected by extreme values?
yes
interquartile range
range of values between the first and third quartiles (q3 - q1)
how to find deviations from the mean
each value minus the mean
mean absolute deviation
average of the absolute values of the deviations around the mean for a set of numbers
variance
the average of the squared deviations about the mean for a set of numbers
standard deviation
square root of the variance
empirical rule
used to state the approx percentage of values that lie within a given number of standard deviations from the mean of a set of approx normally distributed data (only used for 1o, 2o, and 3o)
empirical rule percentages
1o - 68% of data within one standard deviation of mean
20 - 95% within 2 standard deviations
3o - 99.7% within three standard deviations
chebyshev’s theorem
states that at least 1 - 1/k^2 values will fall within +/-k standard deviations of the mean regardless of the shape of the distribution
z score
represents the number of standard deviations by which a value is above or below the mean of a set of numbers when the data are normally distributed
coefficient of variation
the ratio of the standard deviation to the mean, expressed as a percentage, and is denoted CV
Can the values for the range, the interquartile range, the variance, and the standard deviation ever be negative? Explain.
The range and IQR can be negative but not the variance and standard deviations because the deviations for the latter two are squared.
What measure of variability can be used to compare variables when they have different units of measurement?
The coefficient of variation (denoted by CV) because it is based on the ratio, expressed as a percentage, of the standard deviation to the mean.Therefore, regardless of the unit of measure, comparisons may be made using the CV.