Chapter 3 Flashcards

(46 cards)

1
Q

Descriptive Statistics

A

Summarize or describe relevant characteristics of data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Mean

A

Average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Σx

A

Sum of all data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Median

A

Middle value when all data are set in numerical order (count)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Mode

A

Value that occurs with the greatest frequency in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Bimodal

A

Two modes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Multimodal

A

More than 2 modes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

No mode

A

No data value is repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Midrange

A

Largest value+minimum data value/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Rounding rule

A

Round to one place greater than the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Nominal level data

A

Doesn’t make sense to measure center numbers

ranks, zip codes, things that aren’t measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Mean from a frequency distribution

A
Sum of all class midpoints/sum of frequencies
x̅=Σ(f*x)/Σf
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

f

A

Frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

x

A

Class midpoint for frequency distribution, value in weighted mean, frequencies in s, magic in σ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Weighted Mean

A

data contributes more significance than another: break it down

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Weighted mean formula

A

x̅=Σ(w*x)/Σw

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Skewed distribution

A

Data plot is more on one side than the other

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Skewed to the left

A

Negatively skewed

20
Q

Skewed to the right

A

Positively skewed

21
Q

Symmetric Data

A

Zero skewness: mean, median, mode are same

22
Q

Range

A

Largest value-smallest value

23
Q

Standard Deviation for a sample (s)

A

Measure of variation of values about the mean.
s=√ nΣ(x^2)-(Σx)^2/n(n-1)
n=#values
x=frequencies

24
Q

Standard Deviation for a population (σ)

A
Measure of variation of values about the mean.
σ=√Σ(x-μ)^2/N
N=pop. size
μ=mean of pop
x=some magic # you pull out of your ass
25
Variance
s^2, σ^2 s^2 tends to be close to σ^2, making s^2 an unbiased predictor of σ^2. But difficult to understand caz different that original unit.
26
Rule of thumb
95% of data lies between 2 SD of the mean
27
Estimate Min & max data values
x̅-(2*s), x̅+(2*s)
28
Estimate SD
s=range/4
29
Empirical rule for bell shaped
68% of data falls within 1SD of mead, 95% 2SD, 99.7% 3SD
30
Chebyshev's Theorum
For any distribution the proportion of data values lying with K SD of the mean is always at least 1-1/K^2, where K is any positive #>1 K=SD from mean
31
s
sample SD
32
σ
Pop. SD
33
s^2
Sample variance
34
σ^2
Pop. variance
35
<< SD
Values in data set are close together
36
>>SD
Values in data set have large variation
37
Z score (standardized value)
The # of SD that a given value x is above or below the mean. | z=x-x̅/s or z=x-μ/σ
38
Usual z scores
-2 < Z SCORE < or equal to 2 | Unusual data is called outlier data
39
Percentiles
Relative position of a data value compared to the data set in 100 groups. Data is _% BELOW a #
40
Percentile of x equation
x=100(#values below x)/(total# values)
41
Quartiles
Divides group into 4 parts Q1=P25, Q2=P50, Q3+P75
42
Interquartile range (IQR)
IQR=Q3-Q1
43
5 number summary box plot
Minimum, Maximum, median, Q1, Q3
44
Outliers
Data above Q3 or below Q1 by an amount > 1.5 IQR
45
Estimate range
min=x̅-(2*s), max=x̅+(2*s) OR range=s*4
46
Coefficient of variation
s/x̅*100, σ/μ*100 described sd relative to mean