Chapter 2 Flashcards

(65 cards)

1
Q

frequency distribution

A

records number of times each possible thing occurs during an experiment

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

histogram

A

groups adjacent values together to give a visual picture, obscuring noise while preserving important data trends, looks like bar graph

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

real lower limit

A

smallest value that would be classed as falling into the interval, like rounding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

real upper limit

A

largest value that would be classed as being in the interval, like rounding

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

midpoint

A

average of the upper and lower limit presented for convenience

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

outlier

A

extreme value that is widely separated from the rest of the data, frequently representing errors in recording data (but not always)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

normal curve

A

bell-shaped curve that is symmetrical around the center of the distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

kernel density plot

A

pays no attention to mean and standard deviation, instead holds to the idea that each observation might have been slightly different

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

stem-and-leaf display

A

Tukey, exploratory data analysis, helpful for comparing 2 different distributions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

leading digits

A

most significant digits, form the stem (vertical axis) of the stem-and-leaf display

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

stem

A

vertical axis of the stem-and-leaf display, formed by the leading digits/most significant digits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

trailing digits

A

less significant digits, form the leaves (horizontal elements) of the stem-and-leaf display

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

leaves

A

horizontal elements of the stem-and-leaf display, formed by the trailing digits/less significant digits

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

bimodal

A

graph having two predominant peaks instead of one (even when these peaks are not exactly the same height)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

unimodal

A

distribution having only one major peak

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

modality

A

refers to the number of major peaks in a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

negatively skewed

A

distribution with tail going out to the right (they point to the negative)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

positively skewed

A

distribution with tail going out to the left (they point toward the positive)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

skewness

A

statistical measures of the degree of asymmetry

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

kurtosis

A

the relative concentration of scores in the center, the upper and lower ends (tails) and the shoulders (between the center of the tails) of a distribution

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

mesokurtic

A

a normal distribution, with tails normally proportioned (neither too thick nor too thin) and with center normally shaped (neither too many nor too few scores concentrated there)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

platykurtic

A

flatter-shaped distribution where scores are concentrated in the shoulders (pulled in from the tails and down from the center)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

leptokurtic

A

distribution with higher-than-normal center peak and thicker-than-normal tails

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

sigma

A

standard notation for sum (adds up to)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
measures of central tendency
different statistics that measure the "center" of the distribution
26
measures of location
reflect where on the scale the distribution is centered
27
mode
Mo-most common score, advantage: represents the largest number of people & unaffected by extreme scores
28
median
Mdn-corresponds to the point at or below which 50% of the scores fall when the data are arranged in numerical order, advantage: unaffected by extreme scores
29
median location
(N+1)/2
30
mean
X bar, sum of scores divided by # of scores, disadvantage: influenced by extreme scores, value may not actually exist in the data; advantage: can be manipulated algebraically, estimates the population well
31
relation of measures of central tendency to one another
whenever the distribution is normal (unimodal and symmetric), the mean, median, and mode will all be close to one another
32
trimmed means
means calculated on data for which we have discarded a certain percentage of the data at each end of the distribution, to weaken the effects extreme scores have on the mean and to use a population estimate with a small standard of error
33
dispersion
variability around the central measure of tendency (usually around the mean)
34
range
measure of distance from lowest to highest score, relies on the extremes and so may be a distorted picture of the variability
35
interquartile range
discards upper and lower 25%ages of scores, leaving middle half to make up the range (Q3-Q1), can discard too much of the data to be good representative of a sample
36
first quartile
point that cuts off the lowest 25% of a distribution, Q1
37
third quartile
point that cuts off the upper 25% of a distribution, Q3
38
second quartile
median of a distribution, Q2
39
Winsorized sample
using trimmed samples to estimate variability, dropping a %age of the highest and lowest scores and replacing them with copies of the highest and lowest remaining scores
40
absolute value
positive expression of an integer (for example, the absolute value of -3 is 3.)
41
mean absolute deviation
turning all numbers into their absolute values (eliminating negative numbers) prior to finding the mean to determine deviation from the mean
42
standard deviation
sum of all (X-Xbar), squared divided by N-1
43
sample variance
(s squared), part of the whole population
44
population variance
(sigma squared), whole population
45
coefficient of variation
CV=(standard deviation / mean) X 100, to express the answer as a percentage; to determine which of two groups/tests is better
46
statistics
characteristics of samples, designated by Roman letters
47
parameters
characteristics of populations, designated by Greek lettes
48
population mean
symbolized by the Greek mu
49
expected value
long-range average of many samples
50
unbiased estimate
estimator whose expected value equals the parameter to be estimated
51
degrees of freedom
df: losing one degree of freedom (dividing by N-1 instead of just by N) because mu is not known and must be estimated from the sample mean
52
boxplot
aka box-and-whisker plot-method of looking at data, designed by Tukey, includes a scale that covers the whole range of obtained values, a rectangular box drawn from Q1 to Q3 with a vertical line representing the median, and lines called whiskers from the quartiles out to the adjacent values
53
quartile location
taking the 1st and 3rd quartiles, (median location +1)/2
54
inner fences
point that falls 1.5 times the interquartile range below or above the appropriate quartile
55
adjacent values
those actual values in the data that are no more extreme (no farther from the median) than the inner fences
56
deciles
like quartiles, but divide the distribution into 10ths rather than quarters
57
percentiles
divide the distribution into hundredths
58
quantiles/fractiles
dividing data into chunks for statistical purposes, like percentiles
59
linear transformations
multiplying a value by a constant and adding a constant to express the same value in a new way (like converting Celsius degrees into Fahrenheit degrees)
60
nonlinear transformations
using exponents, logarithms, trigonometric functions, etc. to transform values into another expression, usually involve a change in shape of a distribution
61
centering
subtracting sample mean from all of the observations, rendering the new mean 0.00 but not affecting the standard deviation or the variance
62
reflection
preventing subjects from simply checking the same point on the scale all the way down without thinking by reversing the phrasing of the questions (half could be positive, like "strongly agree", and half could be negative, like "strongly disagree"), accomplished by a linear transformation
63
deviation scores
employed to rescale data, subtracting mean from each observation
64
standard scores
creating deviation scores and then dividing them by the standard deviation
65
standardization
creating standard scores from raw scores