Unit 1 - Exploring Data Flashcards

(35 cards)

1
Q

What are 2 measures of center?

A

Mean and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are 3 measures of spread?

A

Standard Deviation, Interquartile Range, Range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between quantitative and categorical data?

A

Quantitative data = number data

Categorical data = word data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What graphs are appropriate for categorical data?

A

Pie charts and bar charts

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What graphs are appropriate for quantitative data?

A

Histograms, box plots, stem plots, dot plots.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Which graphs retain individual observations?

A

Stem plots and dot plots

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

About how many bars should your histogram have?

A

About 5 or 6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When describing a distribution, what 3 things should you include?

A

Shape, center, spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

A distribution that has approximately the same frequency for each data value is ___________.

A

Uniform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

A distribution that has a greater frequency of large values is __________________.

A

Left Skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

A distribution that has a greater frequency of small values is _________________.

A

Right Skew

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

The mean is pulled in the direction of the _______.

A

Skew and outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

In a left skewed distribution, the median is ___________ than the mean.

A

greater than

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

In a unimodal and symmetrical distribution, the mean is _________________ the median.

A

about the same as

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Define statistic

A

a number that describes a sample (like sample mean, sample median, sample min, etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Define parameter

A

a number that describes a population (like population mean, population range, etc)

17
Q

What is a 5 number summary?

A

Min, Q1, Median, Q3, Max

18
Q

What is the formula for calculating the upper boundary for outliers?

A

Q3 + 1.5(IQR)

19
Q

What is the formula for calculating the lower boundary for outliers?

A

Q1 - 1.5(IQR)

20
Q

For what kind of distributions do we prefer to use the median and IQR?

A

Skewed distributions

21
Q

What measure of center and spread do we use if a distribution is approximately symmetrical?

A

Mean and Standard Deviation

22
Q

What does it mean if a statistic is resistant?

A

The statistic is not easily influenced or changed by skew and outliers

23
Q

What does it mean if a statistic is non-resistant?

A

The statistic is easily influenced and changed by skew and outliers

24
Q

Name 2 resistant statistics.

25
Name 2 non-resistant statistics
Mean, Standard Deviation
26
The median of a data set of the price of school lunch items is $2.35. Interpret this number.
The average price of school lunch items is $2.35.
27
The standard deviation of a data set of the price of school lunch items is $0.48. Interpret this number.
0.48 is the average deviation of school lunch prices from the mean school lunch price
28
Which graphs are most appropriate for large data sets, or data with a large range of numbers?
Histograms and box plots
29
What is the symbol for sample mean?
X-bar
30
What is the symbol for population mean?
Mew
31
What is the symbol for sample standard deviation?
s
32
What is the symbol for population standard deviation?
sigma
33
If you add or subtract the same value to every number in a data set, what happens to the measures of location (mean, median, min, max, percentiles)?
They change.
34
If you add or subtract the same value to every number in a data set, what happens to the measures of spread (range, IQR, S.D.)?
The stay the same.
35
If you multiply or divide the numbers in a data set, which summary statistics will change.
All of them: mean, median, range, IQR, S.D. etc.