Chapter 2 Flashcards

(38 cards)

1
Q

Data

A

values of a variable

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What are the 2 types of data?

A

qualitative and quantitative

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the 2 types of quantitative data?

A

discrete

continuous

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Time series data

A

ordered data values observed over time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Cross section data

A

data values observed at a fixed point in time

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Classes

A

categories used for grouping data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Frequency

A

the number of observations that fall in a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Frequency distribution

A

a listing of all classes and their frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

Relative frequency

A

the ratio of the frequency of a class to the total number of observations

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

Relative-frequency distribution

A

a listing of all classes and their relative frequencies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Lower cutpoint

A

the smallest value that could go in a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Upper cutpoint

A

the smallest value that could go in the next higher class (equivalent to the lower cutpoint of the next higher class)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Midpoint

A

the middle of a class, found by averaging its cutpoints

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Width

A

the difference between the cutpoints of a class

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is a frequency distribution?

A

a list or table containing the values of a variable (or set of ranges within which the data fall) and the corresponding frequencies with which each value occurs (or frequencies with which data fall within each range)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Why do we use frequency distributions?

A

it summarizes data by condensing the raw data in a more useful form and allows for a quick visual interpretation of the data

17
Q

Discrete data

A

possible values are countable

18
Q

Continuous data

A

may take on any value in some interval

19
Q

What kind of data uses bar and pie charts?

A

qualitative (category) data

20
Q

Why is it useful to use a stem and leaf diagram?

A

it is a simple way to see distribution details from qualitative data

21
Q

Line charts are used to show…

A

the values of one variable vs TIME

22
Q

Scatter diagrams shows points for…

A

bivariate data

- one value measured on the vertical axis and the other measured on the horizontal axis

23
Q

Mean

A

the arithmetic average of data values

- sum of the values divided by the number of values

24
Q

Median

A

in an ordered array, it is the “middle” number that splits the distribution in half

25
Is the median affected by extreme values? Mode?
NO, NO
26
Mode
a measure of location the value that occurs most often used for numerical or categorical data there may be no mode or there may be serval modes
27
Distribution of a data set
a table, graph, or formula that provides the values of the observations and how often they occur
28
Which measure of location is the "best"?
Usually the mean unless there are extreme values (outliers) then the median is used
29
What does (n) represent? What about (N)?
sample | population
30
What do you do first if you see there is an outlier?
check for human error
31
Variation
gives information on the spread or variability of the data values
32
Range
simplest measure of variation | difference between the largest and smallest observations
33
What are some disadvantages of the range?
it ignores the way in which data are distributed | it's sensitive to outliers
34
What are the benefits of the interquartile range? Equation?
it can eliminate the outlier problems | IQR = 3rd - 1st
35
Degrees of freedom
the number of observations that are free to vary after sample mean has been calculated
36
What is the most commonly used measure of variation?
Standard deviation
37
What must the sum of deviations equal?
0!!!!
38
Coefficient of variation
measures relative variation | always in percentage