Lecture 2- Summarizing data: Descriptive statistics Flashcards

1
Q

What are the three order of measurement of average?

A

First order: Measures of central tendency

  • Second Order: Measures of dispersion
  • Third order: Measures of skewness and kurtosis
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are measures of central tendency?

A

-yield information about

“particular places or locations in a group of numbers.”

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the different types of central tendency?

A
  • Mean
  • Mode
  • Median
  • Percentiles
  • Quartitles
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the arithmetic mean?

A
  • Mean is the average of a group of numbers
  • Applicable for interval and ratio data only
  • Affected by extreme outliers which reduce its reliability
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the mode and the different types of mode?

A
  • Most frequently occurring dataset
  • Applicable for all levels of data

-Bimodal:– In a tie for the most frequently occurring value, two
modes are listed

-Multimodal: Data sets that contain more than two modes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the median and how to work it out?

A
  • Middle number in an order array of numbers
  • If N is odd (N+1)/2
  • If even so they have the same amount of number each side of the dataset
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are quartiles and examples of them

A

• Quartile - measures of central tendency that divide a group of data into four
subgroups

  • Q1: 25% of the data set is below the first quartile
  • Q2: 50% of the data set is below the second quartile
  • Q3: 75% of the data set is below the third quartile
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How to work out quartiles?

A
  • Find a quarter of the number in the dataset for the 1st
  • FInd half of the number for Q2
  • Find 3/4 of the number for Q3
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the measures of dispersion?

A

-tools that describe the
spread or the dispersion of a set of data.

–Range 
– Inter-quartile Range 
–Variance 
– Standard Deviation
–Coefficient of Variation
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the range?

A

-The difference between the largest and the smallest
values in a set of data

Advantage:– easy to compute (ignore how the data
is distributed)

Disadvantage:is affected by extreme values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Inter-quartile range?

A
  • range of values between the first and third quartiles

- Q3-Q1 to find the IQR

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How to work out deviation?

A

-Subtracting the mean from each data value gives the
deviation from the mean (X - µ)

• An examination of deviation from the mean can reveal
information about the variability of the data

-The Sum of Deviation from the arithmetic mean is always zero: Sum (X - µ) = 0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the standard deviation and the variance?

A

add photo

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

formula for variance and standard deviation

A

add photo

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does the standard deviation show?

A

-A low standard deviation means that most of the numbers are very close to the average.

• A high standard deviation means that the numbers are spread out.

• A standard deviation closer to zero, indicates that the points are
closer to the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the empirical rule?

A

-68% of the population members lie within one standard deviation of the
mean

-approximately 95% lie within two standard deviations
of the mean

-approximately 99.7% lie within three standard deviations of the mean.

+ and - the mean

17
Q

What is the coefficient of variation (CV)?

A

Ratio of the standard deviation to the mean, expressed as a percentage

-useful when comparing Std Dev computed
from data with different means

• Measurement of relative dispersion

18
Q

What is the formula for coefficient of variation?

A

CV=standard deviation/mean x 100

19
Q

Measures of shape

A

Symmetrical – the right half is a mirror image of the
left half

• Skewness – shows that the distribution lacks
symmetry; used to denote the data is sparse at one
end, and piled at the other end
– Absence of symmetry
– Extreme values in one side of a distribution

20
Q

Coefficient of skewness

A

formulas for skewness

21
Q

When do you know if something is positive, negative or not skewed?

A

If Sk < 0, the distribution is negatively skewed (skewed to the left).

  • If Sk = 0, the distribution is symmetric (not skewed).
  • If Sk > 0, the distribution is positively skewed (skewed to the right).
22
Q

Mean, median mode of skewness

A
  • If the mean is greater than the mode, the distribution is positively skewed.
  • If the mean is less than the mode, the distribution is negatively skewed.
  • If the mean is greater than the median, the distribution is positively skewed.
  • If the mean is less than the median, the distribution is negatively skewed.
23
Q

What are the different types of kurtosis?

A

Peakedness of a distribution
• Leptokurtic: high and thin
• Mesokurtic: normal in shape
• Platykurtic: flat and spread out

ADD IMAGE