Class 3 Spring 🌷 Flashcards

1
Q

What are the three measures of Central Tendency?

A

Mean, median, mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the measures of Dispersion?

A

Range, IQR, variance, standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What type of data is the mode primarily used for?

A

Categorical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the definition of β€˜mode’?

A

The value with the most occurrences in the data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the formula to calculate the range?

A

Highest number - Lowest number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is variance represented by?

A

sΒ²

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the relationship between variance and standard deviation?

A

Standard deviation is the square root of variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What does the Interquartile Range (IQR) measure?

A

Dispersion related to the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the median represented in a Box-and-Whisker plot?

A

A dark line denoting the median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What percent of data falls between Q1 and the median in a Box-and-Whisker plot?

A

50%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is a characteristic of a right (positive) skewed distribution?

A

Tail on the right side

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Fill in the blank: The _______ is the average squared distance from the mean.

A

Variance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the typical distribution of data within one standard deviation of the mean?

A

About 70%

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What type of data is IQR used with?

A

Numerical data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What does a Box-and-Whisker plot’s whiskers represent?

A

Data outside of the box attempting to capture the spread

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

True or False: Outliers are defined by hard-and-fast rules.

A

False

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is the purpose of identifying outliers in data?

A

Useful for various reasons in statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What are the shapes/modalities that a distribution can have?

A

Uniform, unimodal, bimodal, multimodal

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What does skewness describe in a dataset?

A

Asymmetry of the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Common examples of right skewed data include _______.

A

People’s incomes, house prices, number of accident claims

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What is the measure of centrality that is primarily used for numerical data?

A

Mean and median

22
Q

What is the main question to answer when describing a dataset regarding central tendency?

A

Where is the β€˜middle’ of the dataset?

23
Q

What is the primary measure of dispersion for categorical data?

24
Q

What does the term β€˜deviation’ refer to in statistics?

A

Distance from the mean

25
What is the first step in building a Box-and-Whisker plot?
Drawing a line denoting the median
26
Fill in the blank: The _______ is the typical deviation of observations from the mean.
Standard deviation
27
What percent of data typically falls within two standard deviations of the mean?
About 95%
28
What statistical notation is used for the standard deviation of a sample?
s
29
What is a common method to visualize median and IQR?
Box-and-Whisker plots
30
What is the significance of the first and third quartiles in a Box-and-Whisker plot?
They define the boundaries of the box representing the middle 50% of the data
31
What is a common example of right/positively skewed data?
People's incomes ## Footnote Other examples include mileage on used cars, reaction times, house prices, and number of accident claims.
32
What is a common example of left/negatively skewed data?
Number of fingers ## Footnote Most people have ten fingers, but some may lose one or more. The age at death in wealthy countries is also negatively skewed.
33
What are two top choices for visualizing skewed data?
* Histograms * Box-and-whisker plots
34
In a skewed distribution, where does the mode typically lie?
Under the peak of the distribution.
35
What happens to the mean in a skewed distribution?
The mean gets pulled in the direction of the skew.
36
What is the relationship between skewness and the difference between the mean and median?
The greater the skewness, the greater the difference between the mean and the median.
37
If the data are skewed, which measure of central tendency may not provide a good estimate?
The mean.
38
Fill in the blank: The median and IQR are only sensitive to numbers near _______.
Q1, the median, and Q3.
39
What is the interquartile range (IQR)?
A measure of statistical dispersion.
40
Which measure is likely more useful for understanding a typical individual loan?
The median.
41
Which measure is likely more useful for understanding the total amount needed for 1,000 loans?
The mean.
42
True or False: In very skewed data, the mean provides a good estimate of the data center.
False.
43
What happens to the mean and median in right-skewed data?
Median < Mean.
44
What happens to the mean and median in left-skewed data?
Mean < Median.
45
What is the summary statistic for centrality of data in symmetrical data?
Mean.
46
What is the summary statistic for data spread?
Standard deviation.
47
What statistical tools may not be usable with skewed data?
* t-test * ANOVA
48
What does the median represent in skewed data?
A better estimate of the center than the mean.
49
What is a characteristic of robust statistics in relation to skewness?
They are stable in the presence of extreme observations.
50
What are examples of potentially skewed datasets?
* Sea Turtle Sizes * Stats Test Scores * Swim Times