Section 2 (Pgs 20-25) Flashcards

(47 cards)

1
Q

What are the 3 most common measures of central tendency?

A

Mean
Median
Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Most common measures of spread or variability? (4)

A

Standard deviation
Variance
Range
Semi-interquartile range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Symbol for mean of population?

A

µ (mu)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Symbol for standard deviation of population?

A

σ (sigma)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Symbol for mean of a sample?

A

x̅ (ex bar)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Symbol for standard deviation of a sample?

A

s

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the mean?

A

The sum of data values divided by the number of data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How do you calculate the mean from a frequency table?

A

Sum of frequencies multiplied by the midpoint of the group divided by total number of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How would you describe the sample mean as an estimator of the population mean?
Why?

A

Unbiased estimator

The mean of all possible sample means than can be selected from a population is equal to the population mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Why is the mean an efficient summary statistic?

A

It uses all the data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a negative point about using the mean?

A

It is sensitive to extreme values so would not be used to summarise data with extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the median?

A

The middle value when the data is ranked in numerical order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

If the mean and median are the same, what does this say about the data?

A

The distribution is symmetrical and there are no extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

When is the median more appropriate to use than the mean as a measure of central tendency?

A

When there are extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Is the median affected by extreme values?

A

No

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the mode?

A

The most frequently occurring value in a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the mode useful for?

A

It is the only measure of centre for qualitative data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Is the mode sensitive to extreme values?

A

No, however it is wasteful of the data as it only uses one observation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

When is the mean used?

A

As the measure of centre for quantitative data, unless the distribution of the data is skewed

20
Q

When is the median used?

A

As the measure of centre for qualitative data when the distribution of the data is skewed/ there are extreme values

21
Q

When is the mode used?

A

As the measure of the centre for qualitative data

22
Q

What does the variance and standard deviation measure?

A

The spread of the data above the mean

23
Q

How is the standard deviation related to the variance?

A

It is the square root of the variance

24
Q

What is the variance?

A

The average of the squared deviations for the mean

25
How do you calculate the variance from a frequency table?
Sum of frequencies multiplied by midpoint - mean squared, divided by n -> same as variance equation with f before bracket
26
How would you describe the use of sample variance to estimate the population value?
Biased | Average of all sample variances is not equal to the population value
27
What does the population variance equal, in terms of the sample variance?
Sample variance X n/ n-1
28
How can the variance equation be changed to ensure that the sample variance provides an unbiased estimate of the population value?
n-1 is used as the denominator
29
How does the standard deviation relate to the spread of data?
The larger the standard deviation, the wider the spread of the data
30
If the standard deviation = 1, how many standard deviations would you expect 95% of the data to lie within?
2
31
What would a standard deviation of 0 mean?
There is no variation in the data -> all data is the same
32
What is the range a measure of?
The extremes (not the variability) -> not used very often as a measure of spread
33
What is the inter-quartile range?
The difference between the first and third quartiles
34
Is the inter-quartile range affected by extreme values?
No (it only includes the middle 50% of observations)
35
What is the most commonly used measure of spread of data about the mean for continuous data and most discrete observations?
Standard deviation
36
What is used preferentially to the standard deviation when there are outlying observations?
Inter-quartile range
37
What is the word used to describe data that is not symmetrical?
Skewed
38
What is it called when most values lie towards the bottom of the range and there is a tail to the right?
Positively skewed
39
What is it called when most values lie towards the top fo the range and there is a tail to the left?
Negatively skewed
40
Are positively or negatively skewed data more common?
Positively - negatively skewed data is rare
41
What does the coefficient of skewness do?
Indicates if the data is symmetrical or positively or negatively skewed
42
What does the coefficient of skewness equal if the data is symmetrical?
0
43
What does a coefficient of skewness greater than 0 indicate?
Positive skewness
44
What does a coefficient of skewness less than 0 indicate?
Negative skewness
45
What is kurtosis?
A measure of the peakedness of a distribution
46
What does a value of 0 for the kurtosis indicate?
A shape close to the normal distribution
47
What does a positive value for the kurtosis indicate?
A relatively peaked distribution