W2 Flashcards

1
Q

What are summary statistics?

A

Different values that tell us about the centre and spread of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the median (med)?

A

Centre of a set of data. Can be one piece of data or the average of two.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the median used to best measure?

A

Central tendency for data that’s skewed or has outliers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What’s the formula for finding where the median lies?

A

(n/2) + 0.5 when n = number of pieces of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does ‘n’ equal when finding summary statistics?

A

Number of pieces of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the mean also known as?

A

Average.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is the mean represented and pronounced?

A

‘x’ with a horizontal line above it. Pronounced ‘x bar’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the mean best to measure?

A

Central tendency for symmetrical data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How do you find the mean?

A

Add data and divide by ‘n’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What’s the range used for?

A

To measure the spread of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How do you find the range?

A

Highest piece of data minus the lowest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What’s the Quartile 1 (Q1)?

A

The middle of the half of data below the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What’s the Quartile 3 (Q3)?

A

The middle of the half of data above the median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What does the interquartile range measure (IQR)?

A

Th spread of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How do you find the IQR?

Interquartile range.

A

Q3 minus Q1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What’s the mode? Can there be many?

A

The most occurring value. There can be several in a set of data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

For central tendency with symmetrical data consisting of no outliers, should you find the median or mean?

A

Either.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

For central tendency with skewed data consisting of outliers, should you find the median or mean?

A

Median.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What’s the symbol for sample standard deviation?

A

Lowercase ‘s’.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is sample standard deviation used for?

A

To figure out the percent of data in our sample that is a certain distance from the mean.

21
Q

What’s a parameter describing?

A

The data from a population.

22
Q

What’s a statistic describing?

A

The data from a sample.

23
Q

What kind of a curve is normal distribution?

A

A density curve.

24
Q

What is normal distribution AKA?

A

Bell curve or normal curve.

25
Q

How many peaks in normal distribution?

A

One.

26
Q

What’s a unimodal?

A

When there’s a single peak.

27
Q

Is normal distribution symmetrical?

A

Yes.

28
Q

What’s the symbol for standard deviation population?

A

σ

29
Q

What’s the symbol for mean population?

A

μ

30
Q

What does standard deviation population characterise?

A

The spread of normal distribution.

30
Q

What does the mean population characterise?

A

The position of normal distribution.

31
Q

How does a large standard deviation impact the curve?

A

It becomes more spread out with a flatter curve.

32
Q

How does a small standard deviation impact the curve?

A

It becomes less spread out with a taller curve.

33
Q

Where do you use population standard deviation?

A

When calculating the intervals on standard deviation.

34
Q

What’s the 68-95-99.7 rule?

A

In normal distribution when two intervals from the mean are 68% of the data. Four intervals are 95%. Six intervals are 99.7%.

35
Q

What does the 68-95-99.7 rule apply to?

A

Any normal distribution.

36
Q

Does normal distribution ever touch the ‘x’ axis?

A

No it continues forever.

37
Q

What is defined as an outlier?

A

Data that’s above the Upper Fence or below the Lower Fence.

38
Q

How do you find the Upper Fence?

A

Q3 + 1.5 x IQR

39
Q

How do you find the Lower Fence?

A

Q1 - 1.5 x IQR

40
Q

What’s included in a 5 number summary?

A

Min, Q1, med, Q3 and max

41
Q

What type of data is best to display in a boxplot?

Numerical/categorical

A

Numerical.

42
Q

What are parallel box plots used for?

A

Displaying two or more sets of data on the same number line/axis.

43
Q

What do you refer to when comparing the central tendency of parallel box plots?

A

Medians.

44
Q

What do you refer to when comparing the spread of parallel box plots?

A

The IQR or range.

45
Q

What are back-to-back stem plots used for?

A

Comparing two sets of the same sort of data using the same set of stems.

46
Q

How do you compare the central tendency of a back-to-back stem plot, if one set of data is skewed?

A

By referencing the different medians.

47
Q

How do you compare the central tendency of a back-to-back stem plot, if both sets of data are skewed?

A

By referencing the different means.

48
Q
A