Chapter 3 Vocabulary Flashcards

1
Q

Arithemetic mean

A

The arithmetic mean of a variable is computed by adding all the values of the
variable in the data set and dividing by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Population arithmetic mean

A

The population arithmetic mean, μ, is computed using all the individuals in a population and is a parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Sample arithmetic mean

A

The sample arithmetic mean, x , is computed using sample data and is a statistic.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Mean

A

Although other types of means exist, the arithmetic mean is generally referred to as the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Median

A

The median of a variable is the value that lies in the middle of the data when arranged in
ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Resistant

A

A numerical summary of data is said to be resistant if extreme values (very large or small) relative to the data do not affect its value substantially.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Mode

A

The mode of a variable is the most frequent observation of the variable that occurs in the data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

No mode

A

If no observation occurs more than once, we say the data have no mode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Bimodal

A

If a set of data has two values of the variable that occur with the most frequency, we say the
data set is bimodal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Multimodal

A

if a data set has three or more data values that occur with the highest frequency, the data
set is multimodal.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Dispersion

A

is the degree to which the data are spread out.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Range

A

The range, R, of a variable is the difference between the largest and smallest data value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Deviation about the Mean

A

For a population, the deviation about the mean for the ith observation is
xi – μ. For a sample, the deviation about the mean for the i-th observation is xi - x bar .

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Population standard deviation

A

The population standard deviation, σ, of a variable is the square root of the sum of squared deviations about the population mean divided by the number of observations in the population, N. That is, it is the square root of the mean of the squared deviations about the population mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Sample standard deviation

A

The sample standard deviation, s, of a variable is the square root of the sum of squared deviations about the sample mean divided by the n – 1, where n is the sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Population variance

A

The population variance is the square of the population standard deviation and is denoted σ^2.

17
Q

Sample variance

A

The sample variance is the square of the sample standard deviation and is denoted s^2.

18
Q

Degrees of freedom

A

For the sample standard deviation, this refers to the fact that we divide by n – 1 to compute standard deviation rather than n. We call n – 1 the degrees of freedom because the first n – 1 observations have freedom to be whatever value they wish, but the nth value has no freedom. It must be whatever value forces the sum of the deviations about the mean to equal zero.

19
Q

Bias

A

Whenever a statistic consistently underestimates a parameter, it is said to be biased.

20
Q

The Empirical Rule

A

f a distribution is roughly bell shaped, then (a) Approximately 68% of the data will lie within 1 standard deviation of the mean. (b) Approximately 95% of the data will lie within 2 standard deviations of the mean. (c) Approximately 99.7% of the data will lie within 3 standard deviations of the mean.

21
Q

Weighted mean

A

The weighted mean of a variable is found by multiplying each value of the variable by its corresponding weight, adding these products, and dividing this sum by the sum of the weights.

22
Q

z‐score

A

The z‐score represents the distance that a data value is from the mean in terms of the number of standard deviations.

23
Q

kth percentile

A

The kth percentile, denoted Pk, of a set of data is a value such that k percent of the observations are less than or equal to the value.

24
Q

Quartiles

A

divide data sets into fourths, or four equal parts.

25
Q

Interquartile range

A

The interquartile range, IQR, is the range of the middle 50% of the observations in a data set. That is, the IQR is the difference between the first and third quartiles and is found using the formula IQR = Q3 – Q1.

26
Q

Describe the distribution

A

Describe the distribution means to describe its shape (skewed left, skewed right, symmetric), its center (mean or median), and its spread (standard deviation or interquartile range).

27
Q

Outlier

A

An outlier is an extreme observation in a data set.

28
Q

Fences

A

Fences serve as cutoff points for determining outliers.

29
Q

Five‐number summary

A

The five‐number summary of a set of data consists of the smallest data value, Q1, the median, Q3, and the largest data value.

30
Q

Boxplot

A

Graphical summary of quantitative data used to identify shape of a distribution and outliers.

31
Q

Whiskers

A

The lines drawn from Q1 to the smallest data value that is larger than the lower fence and from Q3 to the largest data value that is smaller than the upper fence in a boxplot.