Chapter 3 Flashcards

(49 cards)

1
Q

It is the practice or science of collecting and analyzing numerical data in large quantities especially for the purpose of inferring proportions in a hole from those in a representative sample

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

It is used in almost all analysis in order to derive insights like projections and how different sets of data affect each other

A

Statistics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

It is a central or typical value for a probability distribution

A

Measures of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

It may also be called a center or location of the distribution

A

Measures of central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Central tendency often called

A

Averages

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Measures of central tendency

A

mean
median
mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

It can be calculated for either a finite set of values or for a theoretical distribution such as the normal distribution

A

Middle tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

The tendency of quantitative data to cluster around some central value

A

Central tendency

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

It is the sum of all measurements divided by the number of observations

A

mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

An average of the data

A

Mean

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

It is the midpoint of our data that separates the upper and lower half of the data set

A

Median

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

These are the only measures of central tendency that can be used for ordinal data, in which values are ranked relative to each other but are not measured absolutely

A

Median and mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

The most frequent value in the data set

A

Mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

The only central tendency measure that can be used with nominal data which have purely qualitative category assignments

A

mode

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Also called the variability, scatter, spread

A

Measures of dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

It is the extent to which a distribution is stretch or squeezed

A

Dispersion

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

Measures of dispersion

A

range
mad
variance
standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

The difference between the smallest and largest data point in the set

A

range

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

It is the average of the absolute deviations from a central point

A

Mean absolute deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

It is a summary statistic of statistical dispersion or variability

A

Mean absolute deviation

21
Q

In simpler terms it means how far are the data points from the mean

A

Mean absolute deviation

22
Q

This is another way of measuring the spread between numbers in a data set

23
Q

It measures how far its number in the set is from the mean

24
Q

It is simply the square root of the variance

A

Standard deviation

25
It is the most commonly used measure to express dispersion
Standard deviation
26
Are cut points dividing the range of a probability distribution into continuous intervals with equal probability or dividing the observations in a sample in the same way
Quantiles
27
Quantiles
quartiles deciles percentiles
28
Divide or cut the data into four parts
Quartiles
29
Are often used as a measure of spread of the data in what is called interquartile range (IQR)
Quartiles
30
The difference between the third quartile and the first quartile
Interquartile range
31
Is the median of the first half of the data set and marks the point at which 25% of the data values are lower and 75% are higher
First quartile
32
Is the median of the second half of the data set and marks the point at which 25% of the data values are higher and 75% lower
Third quartile
33
cut into 10
deciles
34
divide into 1% segments
percentiles
35
are used for larger data sets
deciles and percentiles
36
It is a way of standardizing scores on the same scale by dividing a scores deviation by the standard deviation in a data set
Standard score (z-score)
37
It measures the number of standard deviations a given data point is from the mean
Standard score
38
What it means if it is a negative z score
value is less than the mean
39
What it means if it is a positive z score
value is greater than the mean
40
It represents the ratio of the standard deviation to the mean and it is a useful statistic for comparing the degree of variation from one data series to another even if the means are drastically different from one another
Coefficient of variation
41
It allows investor to determine how much volatility or risk is assumed in comparison to the amount of return expected from investments
Coefficient of variation
42
Refers to distortion or a symmetry in a symmetrical bell curve or normal distribution in a set of data
Skewness
43
What it means if it is a positive skew
The mean is greater than the median
44
What it means if it is a negative skew
The mean is less than the median
45
It is a statistical measure that expresses the extent to which two variables are linearly related
Correlation
46
It is a common tool for describing simple relationships without making a statement about cause and effect
Correlation
47
Correlation coefficient of positive one indicates a
perfect positive correlation or direct relationship
48
Correlation coefficient of negative 1 indicates a
Perfect negative correlation or inverse relationships
49
Correlation coefficient near zero indicates a
No correlation