Stats M. 2 Flashcards

(47 cards)

1
Q

relative frequency distribution

A

listing of distinct values and their relative frequencies (proportions or percentages)

Numerical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Bar Chart

A

Graphical Summary for categorical data
-bars do not touch each other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Pareto diagram

A

graphical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

pie chart

A

graphical summary for categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mean

A

sum of observations divided by the number of observations

numerical summary for quantitative data

sensitive to/affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

median

A

the number that divides the bottom 50% of the data from the top 50% of the data

numerical summary for quantitative data

not sensitive to/not affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mode

A

any value that occurs with the greatest frequency

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

percentiles

A

indicate the point below which a certain percentage of observations fall`

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

quartiles

A

special type of percentile that divides data into quarters

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Q1

A

25%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Q2

A

median- 50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Q3

A

75%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

standard deviation

A

tells us whether the observations within the data set tend to be close to the mean or far away from the mean

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

IQR

A

the difference between Q3 and Q1
tells us about the variability of the middle 50%

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

range

A

difference between the maximum and minimum value

numerical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Dotplot

A

graphical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

histogram

A

graphical summary for quantitative data

Bars touch

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

density plot

A

graphical summary for quantitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

box plot

A

graphical summary for quantitative data
5 number summary (minimum, Q1, median, Q3, maximum)

20
Q

time plots

A

graphical summary for quantitative data

21
Q

S.O.C.S

A

Shape
Outliers
Center
Spread

22
Q

Shape

A

Unimodal, bimodal, multimodal
skewness or symmetrical
-left skewed= tail goes to negative side
-right skewed=tail goes to positive side

23
Q

Outliers

A

unusual values

24
Q

Center
-symmetric + no outliers

A

Report the mean

25
Center -skewed +/or outliers
report the median
26
Spread -symmetric + no outliers
report the standard deviation
27
Spread -skewed +/or outliers
report the IQR
28
Comparative graphical displays (quantitative + categorical)
SOCS Histogram + box plot
29
Bivariate Data
data that contains 2 variables
30
Association (Bivariate Data)
a relationship between two variables
31
Response Variable (Bivariate Data)
measured to make comparisons between groups
32
Explanatory Variable (Bivariate Data)
explains the value of the response variable
33
contingency table
a frequency distribution for bivariate data (also called a two-way or cross-tabulation table)
34
conditional proportions (Bivariate Data)
proportions based on the explanatory variable for the categories of the response variable (divide each cell count by the corresponding row total)
35
No association (Bivariate Data)
values (%) within each column or bar heights of same color are similar
36
Yes association (Bivariate Data)
values (%) within each column or bar heights of same color are different
37
comparative bar chart
a chart that compares the conditional proportion of the response variable within each category of the explanatory variable
38
Mosaic plots
another comparative chart
39
Scatterplots
summarize bivariate quantitative data
40
Positive Association (bivariate quantitative data)
as values of one variable increase, so do values of the other
41
Negative association (bivariate quantitative data)
as values of one variable increase, values of the other variable decrease
42
No association (bivariate quantitative data)
no apparent relationship between the two variables
43
correlation
measure of the strength and direction of the linear relationship between two variable
44
weak correlation
positive: 0 < r < 0.4 negative: -0.4 < r < 0
45
moderate correlation
positive: 0.4 < r < 0.8 negative: -0.8 < r < -0.4
46
strong correlation
positive: 0.8 < r < 1 negative: -1 < r < -0.8
47