Chapter 2: Organizing and Summarizing Data Flashcards

(33 cards)

1
Q

What is frequency distribution?

A

Frequency Distribution lists each category of data and the number of occurrences for each category of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What does a frequency distribution table look like?

A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is relative frequency?

A

Relative Frequency is the proportion (or percent) of observations within a category and is found using the formula:

frequency/sum of all frequencies = relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

How is a relative frequency table organized?

A

A relative frequency distribution table lists each category of data with the relative frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a bar graph?

A

A bar graph is constructed by labeling each category of data on either the horizontal or vertical axis and the frequency or relative frequency of the category on the other axis.

Rectangles of equal width are drawn for each category. The height of each rectangle represents the category’s frequency or relative frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a Pareto Chart?

A

A Pareto chart is a bar graph whose bars are drawn in decreasing order of frequency or relative frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What graph should you use to compare data?

A

A side by side bar graph

Data sets should be compared using relative frequencies, because different sample or population sizes make comparisons using frequencies difficult or misleading.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Can Bar Graphs be drawn horizontally?

A

Yes, horizontal bars are preferred when category names are lengthy.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is a Pie Chart?

A

A pie chart is a circle divided into sectors. Each sector represents a category of data. The area of each sector is proportional to the frequency of the category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the first step to summarizing quantitative data on a display?

A

The first step in summarizing quantitative data is to determine whether the data are discrete or continuous.

*Discrete: can be counted *Continuous: is measured.

Discrete and few different values of the variable - the categories of the data (classes) will be the observations (as in qualitative data)

Continous Data - the categories of data (classes) must be created using intervals of numbers. Also applies if there are too many variables in discrete data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a histogram?

A

A histogram is constructed by drawing rectangles for each class of data.

The height of each rectangle is the frequency or relative frequency of the class.

The width of each rectangle is the same and the rectangles touch each other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What does constructing Frequency and Relative Frequency Distribution from Discrete Data look like?

A

EXAMPLE: The following data represent the number of available cars in a household based on a random sample of 50 households. Construct a frequency and relative frequency distribution.

3 0 1 2 1 1 1 2 0 2

4 2 2 2 1 2 2 0 2 4

1 1 3 2 4 1 2 1 2 2

3 3 2 1 2 2 0 3 2 2

2 3 2 1 2 2 1 1 3 5

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are classes?

A

Classes are categories into which data are grouped.

*When a data set consists of a large number of different discrete data values or when a data set consists of continuous data, we must create classes by using intervals of numbers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When reading a Data Table -

What is a Lower Class Limit?

What is a Upper Class Limit?

What is the Class Width?

A

lower class limit of a class is the smallest value within the class

upper class limit of a class is the largest value within the class

class width is the difference between consecutive lower class limits

Example from chart:

Lower class limit of 1st class = 25

Upper class limit of 1st class = 34

Class width between categories 35-25=10

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Example of organized continuous data on table?

A

Example of continuous data on table:

The following data represent the time between eruptions (in seconds) for a random sample of 45 eruptions.

The smallest data value is 672 and the largest data value is 738. We will create the classes so that the lower class limit of the first class is 670 and the class width is 10 and obtain the following classes:’

670 − 679

680 − 689 etc.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What are the guidelines for determining the Lower Class Limit of the First Class and Class Width?

A

Choosing the Lower Class Limit of the First Class:

Choose the smallest observation in the data set or a convenient number slightly lower than the smallest observation in the data set.

Determining the Class Width

Determine the class width by computing

Largest class number minus smallest class number divided by number of classes.

*round up to convenient number

17
Q

EXAMPLE Constructing a Frequency and Relative Frequency Histogram for Continuous Data

18
Q

What is a stem-and-leaf plot?

A

A stem-and-leaf plot uses digits to the left of the rightmost digit to form the stem. Each rightmost digit forms a leaf.

For example, a data value of 147 would have 14 as the stem and 7 as the leaf.

We let the stem represent the integer portion of the number and the leaf will be the decimal portion. For example, the stem of Alabama (2.8) will be 2 and the leaf will be 8

*Within each stem, rearrange the leaves in ascending order, title the plot, and include a legend to indicate what the values represent.*

19
Q

What does a split stem and leaf plot look like?

20
Q

What is an Advantage of Stem-and-Leaf Diagrams over Histograms?

A

Once a frequency distribution or histogram of continuous data is created, the raw data is lost (unless reported with the frequency distribution), however, the raw data can be retrieved from the stem-and-leaf plot.

21
Q

What is a dot plot?

A

A dot plot is drawn by placing each observation horizontally in increasing order and placing a dot above the observation each time it is observed.

EXAMPLE Drawing a Dot PlotThe following data represent the number of available cars in a household based on a random sample of 50 households. Draw a dot plot of the data.

3 0 1 2 1 1 1 2 0 2 4 2 2 2 1

2 2 0 2 4 1 1 3 2 4 1 2 1 2 2 3

22
Q

Identify the Shape of a Distribution

Uniform Distribution?

Bell-Shaped?

Skewed right?

Skewed Left?

A

Uniform distribution - the frequency of each value of the variable is evenly spread out across the values of the variable

Bell-shaped distribution - the highest frequency occurs in the middle and frequencies tail off to the left and right of the middle

Skewed right - the tail to the right of the peak is longer than the tail to the left of the peak

Skewed left - the tail to the left of the peak is longer than the tail to the right of the peak.

23
Q

What is a Time-Series Plot?

A

A time-series plot is obtained by plotting the time in which a variable is measured on the horizontal axis and the corresponding value of the variable on the vertical axis.

Line segments are then drawn connecting the points.

*If the value of a variable is measured at different points in time, the data are referred to as time series data

24
Q

What is descriptive statistics?

A

Descriptive statistics consists of organizing and summarizing data.

25
What is **inferential statistics**?
**Inferential statistics** uses methods that take a result from a​ sample, extend it to the​ population, and measure the reliability of the result.
26
What graph is common for representing **qualitative data**?
A b**ar graph**.
27
When comparing data sets, what is best to use? Relative Frequencies or frequencies?
**When comparing** data sets, it is **best to use relative frequencies** because different sample or population sizes make comparisons using frequencies difficult or misleading.
28
How do you **find the percentage for a sector of a pie chart?**
To find the percentage of a pie circle - **You must multiply the relative frequency by 360**. ## Footnote **Ex.** Relative frequency of 0.1091 x 360 = 39.2 percent.
29
What is a **cumulative frequency**?
A **cumulative frequency** distribution displays the aggregate frequency of the category. In other words, it displays the total number of observations less than or equal to the upper class limit of the class. **Example:** So the cumulative frequency for the second class is the sum of the frequencies of classes 1 and 2; the cumulative frequency for the third class is the sum of the frequencies of classes 1,2, and 3; and so on.
30
What is a **cumulative relative frequency?**
A **cumulative relative frequency** distribution displays the proportion (or percentage) of observations less than or equal to the upper class limit of the class. **Example looks like this**: *0.10* *\* to calculate add the relative frequencies in ascending form.*
31
What does a c**umulative frequency and relatitive cumulative frequency** chart look like?
It is just the frequencies and relative frequencies added together in ascending form to find a cumulative. \*note relative frequencies are found by dividing the frequency by number of observations.
32
What is an **ogive?** ## Footnote **How is a ogive constructed?**
An **ogive** (read as “oh jive”) is a graph that represents the cumulative frequency or cumulative relative frequency for the class. **How to:** A relative frequency ogive is drawn by plotting points whose x-coordinates are the upper class limit of each class and whose y-coordinates are the cumulative relative frequencies of each class. Then connect the points with line segments. Also, an additional line segment is drawn connecting the first point to the horizontal axis at a location representing the upper limit of the class that would precede the first class (if it existed).
33