Basic Descriptive Statistics Flashcards
(55 cards)
what are statistics?
branch of applied mathematics that involves the collection, description, analysis, and inference of conclusions from quantitative data
what are descriptive stats?
summarizes or describes the characteristics of a data set
what are examples of descriptive stats?
measures of central tendency (mean, median, mode), measures of variability (spread), and frequency distribution (count)
inferential stats?
involves the use of a sample to estimate some characteristics in a large population and/or test a research hypothesis about a given population
primary data
acquired directly from source
secondary data (or archival data)
collected by someone else
nominal data and example
categories without any inherent order or meaningful sequence
ex: fruit colors
ordinal data and example
ranked data
ex: strongly agree to strongly disagree
interval data and example
data with ordered categories and equal intervals between them with no true zero point
ex: temperature
ratio data and example
data with ordered categories and equal intervals between them with a true zero
ex: height, weight, age, income
measures of central tendency definition
univariate statistic that indicates the average observed value of a variable in a data set or the center of the frequency distribution of the data set
what is the median?
variable in the middle, splitting is 50/50
if the distribution is symmetric, mean and median are same/different?
Same
if distribution almost symmetric, mean and median are ___
almost the same
if the distribution is skewed, the mean is pulled in the direction of…
the long tail
what does it mean to have have unstable median?
the median value of a dataset or distribution fluctuates or is sensitive to small changes in the data
is the median resistant to ouliers?
yes
what is mode
most frequent value (value not the frequency itself)
what type of data is mode most commonly used for?
nominal because it identifies the most frequent category
measures of dispersion
descriptive stats that describe how similar a set of scores are to each other
the most similar the scores are to each other, the ____ the measure of dispersion will be
lower
the less similar the scores are to each other, the ___ the measure of dispersion will be
higher
a taller curve has ___ dispersion, a flatter curve has ___ dispersion
less, more
when would you use range?
when you have ordinal data or you are presenting your results with little or no knowledge of stats
rarely used as it is fairly insensitive