HW3 CH2 - measure of center, variation, 5 # Sum, Box Plots Flashcards
Define the measure of center
Descriptive measure that reveals the center or most typical values of a data set
What is a sample mean?
sum of all values divided by the total number of observations in the data set
how do you obtain the sample mean?
add all the data and divide it by how much data there is
what is the symbol for sample mean?
x with a line above it
what is the symbol for population mean?
the u with a tail, “mu”
what is a median?
A number that divides the top 50% of the data from the bottom 50%
how do you find the median?
rearrange numbers from least to greatest, odd # is in the middle, even # is (add both middle #’s)/2
what is mode?
the value that occurs the most often in the data set, frequency > 1
Is it possible for a data set to have 2 or more mode? (T/F)
yes
what is resistant measure?
a measure is robust (resistant) if extreme values have little to no influence on its outcome
what is a robust measure, mean or median?
median
What is measures of Variation (Dispersion)?
descriptive measures that describe how much variation or “spread” there is in a data set
what is range?
The difference between the largest observation and the smallest observation
what are the disadvantages of range?
- measure is based only on 2 values
- not resistant: highly susceptible to outliers
what is deviation?
The difference between an observation and the mean
what is a sample standard deviation?
Roughly on average, the difference between an observation and the mean
Is range resistant?
no
Does range show how spread out the data is?
Yes
is standard deviation robust?
no
Why transform data?
changing units, making the shape symmetric, make the relationship between 2 variables linear
define parameter
numerical summary oof the population
define statistic
numerical summary of the sample
define quartiles
this divides the data set into 4 equal parts
What is the interquartile range?
the difference between the third and the