Chapter 3 & 4 Flashcards
(35 cards)
Name the displays/tables that represent quantitative data.
Stem and Leaf Display, Dotplots, and Histogram which is a Frequency Distribution Table.
Describe Dotplots Displays.
This plot was taken guessing the instructors age.

Name a general rule and difference between historgrams and others (Stem and Leaf plots and Dotplots).
General Rule: Historgrams are used for large data sets. Stem and Leaf Display and Dotplots are used for smaller data sets. Differences: Historgrams do not display raw data but displays frequency (counts). The “others” show raw data and frequency.
Name a general rule and difference between histograms and others (Stem and Leaf plots and Dotplots).
General Rule: Histograms are used for large data sets. Stem and Leaf Display and Dotplots are used for smaller data sets. Differences: Histograms do not display raw data but displays frequency (counts). The “others” show raw data and frequency.
Name a major difference between bar charts and histograms.
Bar charts have a required distance between each bar, whereas histograms do not have any space between each bar.
Humps in a histogram are called what?
Modes. Unimodal - one main peak Bimodal - two main peaks Multimodal -three or more peaks
A histogram is ___________ if you fold it along a vertical line through the middle and have edges match closely.
symmetric.
This describes how data are distributed.
shape of the data.
What measures shape of a distribution?
Skewness.
Here are 3 different shapes, name them in reference to skewness and symmetry.

- Left-Skewed 2. Symmetric 3. Right-Skewed
What are some standard notations used to describe distributions (histograms) numberically.
Mean, Standard Deviation, Variance, and Size.
Name the numerical data properties?
Central Tendency, Variation (Dispersion) and Shape. Central Tendency tells you where the middle is. Variation tells you how spread is the data.

What two tools measures of Central Tendency?
Mean and median.
Describe how the median is found.
Order the data from smallest to largest. If odd n, then middle value of sequence. If even n, then average of 2 middle values.
What 4 tools are used to measure variation.
Range, Interquartile Range, Variance, Standard Deviation.
This is a measure of noncentral tendency and splits ordered data into 4 quarters. What is it?
Quartiles.

Describe the method for finding quartiles.
If n is odd, the middle data is Q2. Then average the first two and final two for Q1 and Q3 respectfully. If n is even, average the two middle data for Q1. Split the entire data in half then the two middle data in each half for Q1 and Q3.
Describe how the interquartile range is found.
After finding the quartiles, Interquartile = Q3 - Q1.
This is a measure of dispersion, ignores how the data are distributed and is the difference between largest and smallest observations.
The Range.
These are measures of dispersion, the most common measures, considered how data are distributed and show variation about the mean.
Standard deviation and variance.
Sample variance defintional formula.
S2 = Σ(Yi - Yavg.)2/(n-1)
Measures the average squared distance that all the observations are from the mean.
The variance.
Sample Standard Deviation Formula.
S = (S2)1/2
Why is the standard deviation preferred over the variance?
The units of the variance are squared and could be misleading, where the standard deviation will hold proper units.



