Chapter 3: Descriptive Statistics: Numerical Measures Flashcards

(50 cards)

1
Q

sample statistics

A

a numerical value used as a summary measure for a sample (ex. the sample mean, x bar, the sample variance, s^2, and the sample standard deviation, s)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

population parameter

A

a numerical value used as a summary measure for a population (ex. the population mean, u, the population variance, o^2, and the population standard deviation, o)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

point estimator

A

a sample statistic, such as x bar, s^2, and s, used to estimate the corresponding population parameter.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

mean

A

a measure of central location computed by summing the data values and dividing by the number of observations.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

median

A

a measure of central location provided by the value in the middle when the data are arranged in ascending order.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

mode

A

a measure of central location, defined as the value that occurs with greatest frequency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

weighted mean

A

the mean obtained by assigning each observation a weight that reflects its importance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

geometric mean

A

a measure of location that is calculated by finding the nth root of the product of n values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

percentile

A

a value that provides information about how the data are spread over the interval from the smallest to the largest value

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

pth percentile

A

for a data set containing n observations, the pth percentile divides the data into two parts: approximately p% of the observation are less that the pth percentile and approximately (100-p)% of the observations are greater than the pth percentile.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

quartiles

A

the 25th, 50th, and 75th percentiles, referred to as the first, second (median), and third quartile, respectively. the quartiles can be used to divide a data set into four parts, with each part containing approximately 25% of the data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

range

A

a measure of variability, defined to be the largest value minus the smallest value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

interquartile range (IQR)

A

a measure of variability, defined to be the difference between the third and first quartiles

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

variance

A

a measure of variability based on the squared deviations of the data values about the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

standard deviation

A

a measure of variability computed by taking the positive square root of the variance.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

coefficient of variation

A

a measure of relative variability computed by dividing the standard deviation by the mean and mulptiplying by 100

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

skewness

A

a measure of the shape of a data distribution. data skewed to the left=negative skewness. skewed to the right=positive skewness. symmetrical data=zero skewness

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

z-score

A

a value computed by dividing the deviation about the mean (xi-x bar) by the standard deviation s. a z-score is referred to as a standardized value and denotes the number of standard deviations xi is from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

chebyshev’s theorem

A

a theorem that can be used to make statements about the proportion of data values that must be within a specified number of standard deviations of the mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

emperical rule

A

a rule that can be used to compute the percentage of data values that must be within one, two and three standard deviations of the mean for the data that exhibit a bell-shaped distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

outlier

A

an unusually small or large data value

22
Q

five number summary

A

a technique that uses 5 numbers to summarize the data: smallest value, 1st quartile, median, 3rd quartile, largest value

23
Q

box plot

A

a graphical summary of data based on a 5 number summary

24
Q

covariance

A

a measure of linear association between two variables. positive values=positive relationship. negative values=negative relationship.

25
correlation coefficient
a measure of linear association between two variables that takes on values between -1 and +1. values near +1 indicate a strong positive linear relationship. values near -1 indicate a strong negative linear relationship. and values near zero lack any liner relationship.
26
a numerical value used as a summary measure for a sample (ex. the sample mean, x bar, the sample variance, s^2, and the sample standard deviation, s)
sample statistics
27
a numerical value used as a summary measure for a population (ex. the population mean, u, the population variance, o^2, and the population standard deviation, o)
population parameter
28
a sample statistic, such as x bar, s^2, and s, used to estimate the corresponding population parameter.
point estimator
29
a measure of central location computed by summing the data values and dividing by the number of observations.
mean
30
a measure of central location provided by the value in the middle when the data are arranged in ascending order.
median
31
a measure of central location, defined as the value that occurs with greatest frequency.
mode
32
the mean obtained by assigning each observation a weight that reflects its importance.
weighted mean
33
a measure of location that is calculated by finding the nth root of the product of n values.
geometric mean
34
a value that provides information about how the data are spread over the interval from the smallest to the largest value
percentile
35
for a data set containing n observations, the pth percentile divides the data into two parts: approximately p% of the observation are less that the pth percentile and approximately (100-p)% of the observations are greater than the pth percentile.
pth percentile
36
the 25th, 50th, and 75th percentiles, referred to as the first, second (median), and third quartile, respectively. the quartiles can be used to divide a data set into four parts, with each part containing approximately 25% of the data.
quartiles
37
a measure of variability, defined to be the largest value minus the smallest value.
range
38
a measure of variability, defined to be the difference between the third and first quartiles
interquartile range (IQR)
39
a measure of variability based on the squared deviations of the data values about the mean.
variance
40
a measure of variability computed by taking the positive square root of the variance.
standard deviation
41
a measure of relative variability computed by dividing the standard deviation by the mean and mulptiplying by 100
coefficient of variation
42
a measure of the shape of a data distribution. data skewed to the left=negative skewness. skewed to the right=positive skewness. symmetrical data=zero skewness
skewness
43
a value computed by dividing the deviation about the mean (xi-x bar) by the standard deviation s. a z-score is referred to as a standardized value and denotes the number of standard deviations xi is from the mean.
z-score
44
a theorem that can be used to make statements about the proportion of data values that must be within a specified number of standard deviations of the mean
chebyshev's theorem
45
a rule that can be used to compute the percentage of data values that must be within one, two and three standard deviations of the mean for the data that exhibit a bell-shaped distribution.
emperical rule
46
an unusually small or large data value
outlier
47
a technique that uses 5 numbers to summarize the data: smallest value, 1st quartile, median, 3rd quartile, largest value
five number summary
48
a graphical summary of data based on a 5 number summary
box plot
49
a measure of linear association between two variables. positive values=positive relationship. negative values=negative relationship.
covariance
50
a measure of linear association between two variables that takes on values between -1 and +1. values near +1 indicate a strong positive linear relationship. values near -1 indicate a strong negative linear relationship. and values near zero lack any liner relationship.
correlation coefficient