{ "@context": "https://schema.org", "@type": "Organization", "name": "Brainscape", "url": "https://www.brainscape.com/", "logo": "https://www.brainscape.com/pks/images/cms/public-views/shared/Brainscape-logo-c4e172b280b4616f7fda.svg", "sameAs": [ "https://www.facebook.com/Brainscape", "https://x.com/brainscape", "https://www.linkedin.com/company/brainscape", "https://www.instagram.com/brainscape/", "https://www.tiktok.com/@brainscapeu", "https://www.pinterest.com/brainscape/", "https://www.youtube.com/@BrainscapeNY" ], "contactPoint": { "@type": "ContactPoint", "telephone": "(929) 334-4005", "contactType": "customer service", "availableLanguage": ["English"] }, "founder": { "@type": "Person", "name": "Andrew Cohen" }, "description": "Brainscape’s spaced repetition system is proven to DOUBLE learning results! Find, make, and study flashcards online or in our mobile app. Serious learners only.", "address": { "@type": "PostalAddress", "streetAddress": "159 W 25th St, Ste 517", "addressLocality": "New York", "addressRegion": "NY", "postalCode": "10001", "addressCountry": "USA" } }

Chapter 3: Describing, Exploring, and Comparing Data Flashcards

(57 cards)

1
Q

Measure of Center

A

the value at the center or middle of a data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Arithmetic Mean (Mean)

A

the measure of center obtained by adding the values and dividing the total by the numbers of value. What most people call an average

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

median

A

the middle value when the original data values are arranged in order of increasing (or decreasing) magnitude

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Are median values affected by extreme values?

A

No, they are resistant measures of the center.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is the median found for a data set that has an odd number of values?

A
  1. Sort the values 2. The median is the number located in the exact middle of the list.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

How is the median found for a data set that has an even number of values?

A
  1. Sort the values 2. The median is found by computing the mean of the two middle numbers.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mode

A

the value that occurs with the greatest frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

bimodal

A

two data values occur with the same greatest frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

multimodal

A

more than data values occur with the same greatest frequency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

no mode

A

no data value is repeated

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Which measure of central tendency can be used with nominal data.

A

Only mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

midrange

A

the value midway between the maximum and minimum values in the original data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How is the midrange calculated?

A

(max value + min value)/2

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the range of a set of data values?

A

The difference between the max data value and the min data value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

How is the range of a set of data values calculated?

A

range= (max value) - (min value)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the standard deviation of a set of sample values?

A

A measure of how much data values deviate from the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Can the value of a standard deviation be negative?

A

NO!

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What units are standard deviations expressed in?

A

The units are the same as the units of the original data values.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the range rule of thumb for understanding standard deviation?

A

For many data sets, the vast majority (such as 95%) of sample values lie within two standard deviations of the mean.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

How are “usual” values in a data set determined using the range rule of thumb?

A

(mean)+/-2*(standard deviation)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Using the range rule of thumb, how are standard deviations roughly estimated from a collection of known samples?

A

range/4

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

variance

A

a measure of variation equal to the square of the standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Why is the sample variance s^2 an unbiased estimator of the population variance?

A

because the values of s^2 tend to target the value of population variance instead of systematically tending to overestimate or underestimate population variance.

24
Q

What is the empirical rule?

A

For data sets having a distribution that is approximately bell shaped, the following properties apply: ~68% of all values fall with in 1 standard deviation of the mean; ~95% of all values fall within 2 standard deviations of the mean; ~99.7% of all values fall within 3 standard deviations of the mean.

25
What is Chebyshev's Theorem?
The proportion (or fraction) of any set of data lying within K standard deviations of the mean is always at least 1-1K^2, where K is any positive number greater than 1.
26
Using Chebyshev's Theorem, what does K=2 mean?
At least 3/4 (or 75%) of all values lie within 2 standard deviations of the mean.
27
Using Chebyshev's Theorem, what does K=3 mean?
At least 8/9 (or 89%) of all values lie within 3 standard deviations of the mean.
28
variance
a measure of variation equal to the square of the standard deviation
29
Why is the sample variance s^2 an unbiased estimator of the population variance?
because the values of s^2 tend to target the value of population variance instead of systematically tending to overestimate or underestimate population variance.
30
What is the empirical rule?
For data sets having a distribution that is approximately bell shaped, the following properties apply: ~68% of all values fall with in 1 standard deviation of the mean; ~95% of all values fall within 2 standard deviations of the mean; ~99.7% of all values fall within 3 standard deviations of the mean.
31
What is Chebyshev's Theorem?
The proportion (or fraction) of any set of data lying within K standard deviations of the mean is always at least 1-1K^2, where K is any positive number greater than 1.
32
Using Chebyshev's Theorem, what does K=2 mean?
At least 3/4 (or 75%) of all values lie within 2 standard deviations of the mean.
33
Using Chebyshev's Theorem, what does K=3 mean?
At least 8/9 (or 89%) of all values lie within 3 standard deviations of the mean.
34
What is the coefficient of variation?
For a set of nonnegative sample or population data, expressed as percent, CV describes the standard deviation relative to the mean.
35
How is the coefficient of variation calculated?
cv= (standard deviation/the mean) *100
36
What are measures of relative standing?
Numbers showing the location of data values relative to the other values within a data set.
37
In which ways are measures of relative standing used?
To compare values from different data sets or to compare values within the same data set.
38
Name 4 examples of measures of relative standing.
z scores, percentiles, quartiles, and boxplots
39
What is a z score?
The number of standard deviations that a given value x is above or below the mean.
40
How are z scores determined?
z= (x-the mean)/standard deviation; round score to 2 decimal places
41
Whenever a value is less than the mean, is its z-score positive or negative?
Negative
42
What are ordinary z-score values?
-2<=2
43
What are unusual z-score values?
zscore< -2 or zscore>2
44
What are percentiles?
Measures of location that divide a set of data into 100 groups with about 1% of the values in each group.
45
How is the percentile of a data value found?
(# of values less than x/ total # of values) *100
46
How is a percentile converted to a data value?
L=(k/100)*n, where n=total number of values in the data set, k=percentile being used, L=locator that gives position of a value, Pk= kth percentile
47
How is Pk found with a "L" value that is a whole number?
By adding the Lth value and the next value and dividing the total by 2.
48
How is Pk found with a "L" value that is not a whole number?
By rounding L up to the next larger whole number.
49
What are quartiles?
Measures of location, denoted Q1, Q2, and Q3, which divide a set of data into 4 equal parts with about 25% of the values in each group.
50
Q1
separates the bottom 25% of sorted values from the top 75%
51
Q2
same as the median; separates the bottom 50% of sorted values from the top 50%
52
Q3
separates the bottom 75% of sorted values from the top 25%
53
What is a interquartile range (IQR)?
Q3-Q1
54
What is a semi-interquartile range?
(Q3-Q1)/2
55
What is a midquartile?
(Q3+Q1)/2
56
How is the 10-90 percentile range determined?
P90-P10
57
What is a boxplot?
A graph of a data set that consists of a line extending from the minimum value to the maximum value, and a box with lines drawn at Q1, the median, and Q3.