Reading Quiz 1 Flashcards

1
Q

Distribution

A

Distribution of a variable indicates what values a variable takes n and the frequency at which it takes on these values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Key features of a histogram

A

Center, spread, shape, outliers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Three basic shapes

A

Symmetric, skewed right, skewed left

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Shape of distribution can also be described

A

By referring to number of modes

Uniondale, bimodal, multimodal, or uniform

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Measures of center

A

Mean and median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Sample mean

A

Arithmetic average or arithmetic mean, average of a set of data values

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Median

A

Middle number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Median position formula

A

Indicates where the median will lie
(n+1)/2
n = number of numbers in the data set
Formula only indicates where median is not what median is

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Perfectly symmetric vs skewed

A

If perfectly symmetric, mean equals median

If skewed, mean farther out in long tail than median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Measures of spread

A

Range, interquartile range, five number summary, variance and sample standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Range

A

Largest number minus smallest number

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Interquartile range

A

Q3 - Q1

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Five number summary

A

Minimum, Q1, median, Q3, maximum

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Most commonly used measure of spread

A

Standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Variance

A

s^2 = (Σ(x1 - xbar)^2)/(n-1)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Standard deviation

A

The square root of the variance, represented by s
Measures how the numbers are spread out from the mean
s = square root of variance formula
Nonresistant

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Deviation of xi from the mean

A

xi - xbar

Sum of all deviations of the mean equals zero

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

Degrees of freedom

A

Quantity n - 1

Appears in the denominator of the formulas for variance and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

Symmetric measures

A

Mean and standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

Skewed sets measures

A

Median and five number summary

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

Outlier

A

An individual observation that falls outside the overall pattern of the graph
striking deviations

22
Q

Outlier test

A

Data point is outlier if it lies more than 1.5 interquartile ranges below Q1 or above Q3

23
Q

two types of graphs most appropriate for categorical data

A

pie charts and bar graphs

24
Q

graph inappropriate for when several percentages don’t represent portions of same whole

25
want raw data values, center, shape, spread, too many for dot plot, what graph
stemplot
26
histogram
``` breaks the range of values of a variable into classes and displays only the count or percent of the observations that fall into each class most common graph of distribution of quantitative variable ```
27
ogive
relative cumulative frequency graph horizontal axis: values of variable vertical axis: relative cumulative frequency
28
how to find center of ogive
horizontal line from 50% on vertical axis to graph, that value is the center
29
time plot axes
time is on horizontal axis
30
trend
on time plot, overall upward or downward slope
31
seasonal variation
time plot, shorter-term, regularly occurring, rise and fall variations
32
resistant measure
measure of center of spread is relatively unaffected by extreme observations
33
two resistant measures
median and interquartile range
34
first quartile
the median of the subset of observations whose position in the ordered list is to the left of the overall median
35
graph that gives picture of five number summary
boxplot
36
IQR
Q3-Q1
37
difference between regular and modified boxplot
regular is graph of five number summary | modified plots suspected outliers individually
38
measures of spread
standard deviation and IQR
39
when is standard deviation 0
when there is no spread aka all observations are the same value
40
adding same number to each distribution
adds a to measures of center and to quartiles but does not change measures of spread
41
multiply each observation by same number
multiplies both measures of center (mean and median) and measures of spread (IQR and standard deviation) by b
42
three graphical measures of comparing distributions
bar charts, back to back stemplots, and side by side boxplots
43
categorical variables
place individuals into groups or categories (qualitative)
44
quantitative variables
numeric measures, makes sense to perform arithmetic operations such as adding or averaging
45
most appropriate displays of categorical data
pie charts dot plots bar graphs
46
best displays for quantitative data
dot plots stem plots histograms
47
bins
values in piles, histograms, need to be physically and numerically equal in width
48
rule of thumb bin number
square root of number of observations
49
spread
level of variability, range also a measure of this
50
standard deviation
measure of average distance of all observations from the mean
51
box plots
not ideal indicators of shape and should not be used if there are other options