Formulas to remember Flashcards

1
Q

Formula for the width of a class in a histogram

A

(Largest value - Smallest Value) / (Number of Classes)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the reccomended amount of classes for a histogram?

A

2^k > n

To determine if our choices are valid, we can also follow the rule of 2 to the k larger than n.

The first value of k that brings it above n is the number of classes we should choose

It is reccomended that it be between 5 and 20

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between Histograms and Bar Charts?

A

The histogram is specifically for class intervals and numbers that we can add together to create cumulative frequency distributions. These are specific to quantitative data. Earlier, we depicted sleep hours in a histogram.

In contrast, bar charts deal with categories, classes, and labels, such as nominal and ordinal data. Earlier, we depicted student motivation for a major as a bar chart.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Histogram

A

A histogram displays:

Class intervals
Class frequencies

Histograms are appropriate for quantitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Bar Chart

A

A bar chart displays:

Frequencies for a set of categories or classes
Bar charts are better for qualitative data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Ogive

A

Definition: Graphical representation showing cumulative frequency through class midpoints.

Calculation: Plot cumulative frequency against class midpoints.

Cumulative Nature: Frequency increases cumulatively.

Endpoint: Typically stops at 100% frequency.

Purpose: Alternative data visualization.

Example: First class interval 25-31, midpoint 28, plotted on ogive.

Application: Analyzing cumulative trends in data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Stem and Leaf Plot

A

Definition: Technique for visualizing data distribution by showing stems (leading digits) and leaves (trailing digits).

Purpose: Provides a snapshot of data distribution, aiding in quick understanding.

Procedure:
Choosing Stem: Select stem (leading digit) based on data range.

Adding Leaves: For each stem, list leaves (trailing digits) in ascending order.

Rounding Data: Sometimes necessary to round data to fit display, particularly when using larger leaf units.

Application: Used in descriptive statistics to illustrate data distribution effectively.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Calculating the Median

A

Calculating the Median

Step 1: Sort Values
Arrange values in ascending or descending order.

Step 2: Verify Odd/Even
Determine if the dataset has an odd or even number of values.

Step 3: Split Dataset
Divide the dataset into two equal halves.

Step 4: Round Up
Round the split position to the nearest integer.

Conclusion: The value at the rounded-up position represents the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q
A
How well did you know this?
1
Not at all
2
3
4
5
Perfectly