Unit 1 Flashcards
date: 8.5 (31 cards)
categorical variable
describes but does not rank
quantitative variable
measures or ranks
What is a bar graph used for?
A bar graph is used to display and compare the frequency, count, or other measures for different categories.
True or False: A two-way table can show the relationship between two categorical variables.
True
Fill in the blank: A __________ plot is a graphical display that shows the distribution of a quantitative variable using dots.
dot
Short Answer: Describe a stemplot and its purpose.
A stemplot is a graphical method for displaying quantitative data in a way that retains the original data values while showing the distribution.
What type of data is best represented by a mosaic plot?
Categorical data
What is a histogram?
- A histogram is a graphical representation of the distribution of numerical data, showing the frequency of data points within specified ranges (bins).
- can display both continuous and discrete data
- bars on a histogram are adjacent
What term describes the overall shape of a distribution?
Distribution shape
True or False: The center of a distribution can be described using the mean, median, or mode.
True
Fill in the blank: The __________ of a distribution refers to the spread or dispersion of the data points.
variability
Which measure is NOT typically used to describe the center of a distribution: mean, median, mode, or range?
range
What do we call data points that lie significantly outside the overall pattern of a distribution?
outliers
What is the definition of standard deviation?
Standard deviation is a measure of the amount of variation or dispersion in a set of values.
low SD indicates that the data points are close to the mean
5-number summary
- minimum
- Q1
- median
- Q3
- maximum
- IQR
IQR
range of middle 50% (Q3-Q1)
a measure of variance
finding outliers
IQR x 1.5
What is a box plot used for?
A box plot is used to visually represent the distribution of a dataset, highlighting the median, quartiles, and potential outliers.
the box represents the IQR; the line inside the box rperesents median
doesn’t show mean
z-score
of standard deviations for a given x above/below the mean