Chapter 3 & 4 Flashcards

(35 cards)

1
Q

Name the displays/tables that represent quantitative data.

A

Stem and Leaf Display, Dotplots, and Histogram which is a Frequency Distribution Table.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Describe Dotplots Displays.

A

This plot was taken guessing the instructors age.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Name a general rule and difference between historgrams and others (Stem and Leaf plots and Dotplots).

A

General Rule: Historgrams are used for large data sets. Stem and Leaf Display and Dotplots are used for smaller data sets. Differences: Historgrams do not display raw data but displays frequency (counts). The “others” show raw data and frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Name a general rule and difference between histograms and others (Stem and Leaf plots and Dotplots).

A

General Rule: Histograms are used for large data sets. Stem and Leaf Display and Dotplots are used for smaller data sets. Differences: Histograms do not display raw data but displays frequency (counts). The “others” show raw data and frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Name a major difference between bar charts and histograms.

A

Bar charts have a required distance between each bar, whereas histograms do not have any space between each bar.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Humps in a histogram are called what?

A

Modes. Unimodal - one main peak Bimodal - two main peaks Multimodal -three or more peaks

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

A histogram is ___________ if you fold it along a vertical line through the middle and have edges match closely.

A

symmetric.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

This describes how data are distributed.

A

shape of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What measures shape of a distribution?

A

Skewness.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Here are 3 different shapes, name them in reference to skewness and symmetry.

A
  1. Left-Skewed 2. Symmetric 3. Right-Skewed
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are some standard notations used to describe distributions (histograms) numberically.

A

Mean, Standard Deviation, Variance, and Size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Name the numerical data properties?

A

Central Tendency, Variation (Dispersion) and Shape. Central Tendency tells you where the middle is. Variation tells you how spread is the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What two tools measures of Central Tendency?

A

Mean and median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Describe how the median is found.

A

Order the data from smallest to largest. If odd n, then middle value of sequence. If even n, then average of 2 middle values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What 4 tools are used to measure variation.

A

Range, Interquartile Range, Variance, Standard Deviation.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

This is a measure of noncentral tendency and splits ordered data into 4 quarters. What is it?

17
Q

Describe the method for finding quartiles.

A

If n is odd, the middle data is Q2. Then average the first two and final two for Q1 and Q3 respectfully. If n is even, average the two middle data for Q1. Split the entire data in half then the two middle data in each half for Q1 and Q3.

18
Q

Describe how the interquartile range is found.

A

After finding the quartiles, Interquartile = Q3 - Q1.

19
Q

This is a measure of dispersion, ignores how the data are distributed and is the difference between largest and smallest observations.

20
Q

These are measures of dispersion, the most common measures, considered how data are distributed and show variation about the mean.

A

Standard deviation and variance.

21
Q

Sample variance defintional formula.

A

S2 = Σ(Yi - Yavg.)2/(n-1)

22
Q

Measures the average squared distance that all the observations are from the mean.

A

The variance.

23
Q

Sample Standard Deviation Formula.

24
Q

Why is the standard deviation preferred over the variance?

A

The units of the variance are squared and could be misleading, where the standard deviation will hold proper units.

25
Variance Computational Formula.
s2 = (Σy2 - ((Σy)2/n)) / (n-1)
26
A \_-\_\_\_\_\_\_ _________ consists of the median, the quartiles, and the extremes of the data.
5-number summary.
27
What is a box plot?
A graphical display of data using 5-number summary.
28
Given these boxplots, determine the shape to the approiate boxplot.
From left to right: Left-Skewed, Symmetric, Right-Skewed.
29
When you describe a distribution, you should always start by talking about three things. What are they?
Shape, center, and spread.
30
The bins, together with these counts, give the _____________ of the quantitative variable and provide the building blocks for the display of the distribution, called a \_\_\_\_\_\_\_.
distribution, histogram.
31
In terms of central tendency and the existence of outliers, which a better tool to use the median or the mean? Why?
Median. It is not affected by outliers, the mean is.
32
In terms of variation and extreme values (which can present skewness), which is a better tool; range or I.Q.R.? Why?
I.Q.R., it is not effected by extreme values (outliers) or skewness. Range can be used when the spread (variation) is not effected by outliers.
33
Like the mean, the _______ \_\_\_\_\_\_\_\_\_\_ is appropriate only for symmetric data. One way to think about spread is to examine how far each data value is from the mean. The difference is call a \_\_\_\_\_\_\_\_\_\_.
standard deviaton, deviation.
34
Always pair the _______ with the IQR and the ______ with the standard deviation.
median, mean.
35
An outlier is an extreme value that differs greatly from other values in a set of values. As a "rule of thumb", an extreme value is considered to be an outlier if it is at least ___ interquartile ranges below the first quartile (Q1), or at least ___ interquartile ranges above the third quartile (Q3).
1.5 Supremum = 1.5(IQR) + Q3. Infimum = Q1 - 1.5(IQR). Therefore the outliers will be greater than the supremum and less than the infimum.