Exam 3 Flashcards

Descriptive Statistics, Frequency Distributions and Frequency Distribution Tables, Measures of Central Tendency, and Measures of Variability.

1
Q

Frequency Distribution

A

Is an organized tabulation of the number of individuals in each category on the scale of measurement.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Absolute Frequency (f)

A

The number of participants that fall in each category.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Relative Frequency (rf)

A

The proportion of participants that fall in each category.

-f/N where N=total number of scores

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Percent

A

rf*100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cumulative Frequency (cf)

A

The number of people that score AT or LOWER than a given score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Cumulative Relative Frequency (crf)

A
  • Use rf column

- Proportion of people that scored AT or BELOW a given score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Cumulative Percent

A

crf*100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Real Limits and Frequency Distributions

A

Begin 1 unit below lowest score and end 1 unit above highest score.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Frequency Graphs

A
  • Grouped or Ungrouped (interval) Scores

- Used when you have discrete variables

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Frequency Histogram

A

No spaces or gaps between bars.

-Used when the data consist of numerical scores that have been measured on an interval or ratio scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Frequency Polygon

A

Dots connected by a continuous line that begins and ends on the x-axis.
-Used when the data consist of numerical scores from an interval or ratio scale.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Bar Graph

A

Much like a histogram, except there are spaces between the bars.
-Used with nominal and ordinal scales.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Symmetrical Distribution

A

It is possible to draw a line down the middle so that one side of the distribution is a mirror image of the other.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Skewed Distribution

A

The scores tend to pile up toward one end of the scale and taper off gradually at the other end.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Positively Skewed Distribution

A

When the scores pile up on the left side of the distribution. The tail points toward the positive end of the x-axis (right).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Negatively Skewed Distribution

A

When the scores pile up on the right side of the distribution. The tail points toward the negative end of the x-axis (left).

17
Q

Line Plot

A

Same as polygon, but left and rightmost points are NOT closed at the abscissa.

18
Q

Stem and Leaf

A

Obtains the raw data in the display.

19
Q

Tukey’s Tallies

A

Does not maintain raw data.

20
Q

Heuristic for Grouping Scores

A

1) Determine the number of groups
- 5-15
2) Determine the size (width) of the interval (2,3, or some multiple of 5)
- Highest score-Lowest score
- Divide by number of groups
- Round to nearest commonly used interval size
3) Determine the beginning lowest interval (should be a multiple of width)

21
Q

Measures of Central Tendency

A

Mode, Median, Mean

22
Q

Mode

A

Most commonly occurring score.
Advantages: score that actually occurs in the data set, unaffected by extreme scores, represents most common observation
Disadvantage: may not be representative of entire distribution of scores

23
Q

Median

A

Divides the distribution into equal halves.
(N+1)/2=Median Location
Advantages: takes into account all the data in the distribution, unaffected by extreme scores (use when distributions are skewed), use if have missing values in data set, open-ended distribution, use with an ordinal scale
Disadvantages: value may not exist in the data, does not enter readily into equations and more difficult to work with, treats all scores alike; differences in magnitude not taken into account

24
Q

Mean

A

Arithmetic average of scores.
Advantages: representative of every score in the distribution, closely related to variance and standard deviation
Disadvantages: affected by extreme scores or “outliers” when you have only a limited number of scores in your distribution, value may not exist in data

25
Central Tendency
An attempt to find a single score that represents the center of a distribution. Strives to find a number representative of the whole distribution.
26
Variability
A qualitative measurement of the differences between scores and the degree to which they are different. Are they spread out or clustered?
27
Biased Variance
An average value of the statistic that is equal to the population parameter.
28
Unbiased Variance
An average value of the statistic that either over or underestimates the corresponding population parameter.
29
Sum of Squares
How each score varies from the mean. Advantage: takes into account ALL the scores in a distribution. Disadvantage: size of SS depends on the amount of variability and is influenced by N.
30
Population Variance
Average of the SUM of squared deviations from the mean (Mean Squared Distance). -Takes into account N.
31
Population Standard Deviation
Represents an average distance or direction from the mean.
32
Sample Variance
Average SUM of squared deviations from the mean. | -Takes into account n.
33
Standard Deviation
Represents an average distance or direction from the mean.