Chapter 2 Flashcards
(30 cards)
Frequency table
Shows the counts of each categorical variable
Proportion
Number who agree/ total
Two way table
A visual used to show the relationship between 2 categorical variables
Dotplot
Dot placed over the axis for each case in the dataset. Good for small data sets
Histogram
Groups quantitative data into bins. X axis is numeric
Bar charts
Splits categorical data up into fixed groups. X axis is not numeric
Symmetric distribution
If folded over s vertical line, the two sides of the plot match closely
Skewed distribution
Data is piled up on one side of the graph
Right skew
data piled to the left, tail extends right
Left skew
data piled to the right, tail extends left
Bell shaped distribution
Plot is symmetric and bell shaped
Resistance
A stat is relatively unaffected by extreme values. Median is resistant, mean is not
Outlier
Observed value is noticeably distant from other data points
Standard deviation
Measures the spread of the data in a sample
The larger the standard deviation…
The more data variability
95% rule
If distribution is bell shaped, 95% of the data falls within two standard deviations of the mean
Z score
Tells how many standard deviations a data value is from the mean and can determine how unusual it is
How to find a z score
(Value - mean) / standard deviation
Percentiles
The pth percentile is the value of a qualitative variable greater than p% of the data
Five number summary
Minimum, maximum, median, Q1 and Q3. This set divides the datasets into fourths
Interquartile range
Q3-Q1
Range
Maximum - minimum
Box plot
Graphical display of the 5 number summary for a single quantitative variable
Side by side graph
Used when there’s 1 categorical and one quantitative variable. Includes a graph for the q variable for each group in the c variable