Reading 7 Statistical Concepts and Market Returns Flashcards

1
Q

Explain what kurtosis measures, and differentiate between a leptokurtic distribution, a platykurtic distribution, and a mesokurtic distribution.

A

Kurtosis measures a distribution peak compared to the normal distribution kurtosis of 3.

Leptokurtic: more peaked, fatter tails, kurtosis greater than three (excess kurtosis > 0).

Platykurtic: less peaked, thinner tails, kurtosis less than 3 (excess kurtosis < 0).

A mesokurtic (normal) distribution has kurtosis of 3 (excess kurtosis of 0).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Give the formula used to compare the relative dispersion of returns on the two investments using the coefficient of variation.

A

CV is the ratio of standard deviation of the data set to its mean

CV = s / X
¯¯¯

where:

s = sample standard deviation

¯¯¯
X = the sample mean

Note: CV measures the risk per unit of return.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Explain Chebyshev’s inequality, and describe the advantage of using it.

A

A method of approximating the minimum proportion of observations that lie within k standard deviations from the mean of a data set.

The advantage of using Chebyshev’s inequality is that it holds for samples and populations as well as for discrete and continuous data, regardless of the shape of the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Describe when arithmetic mean and geometric mean should be used.

A

Arithmetic mean – single period; arithmetic mean is the average of one-period returns.

Geometric mean – multiple periods; geometric mean links investment performance over time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

How is the weighted mean calculated?

A

Where an arithmetic mean assigns equal weight to each observation in the data set, a weighted mean assigns different weights to different observations. (Example: the effect of asset class weight on portfolio mean return.)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

List the properties of the arithmetic mean, and identify a potential problem with it.

A

All observations are used in the computation of the arithmetic mean, and a data set has only one arithmetic mean.

All interval and ratio data sets have an arithmetic mean.

The sum of the deviations from the arithmetic mean is always 0.

A potential problem with the arithmetic mean occurs with extreme high or low values, which can disproportionately alter it.

Note: The arithmetic mean is the most frequently used measure of central tendency.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Give the formula used to calculate sample skewness.

A

SK=[n(n−1)(
n−2)]∑ni=1(Xi−––X)3s3

n = number of observations in the sample

s = sample standard deviation

Note: when a distribution is nonsymmetrical, it is considered skewed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Give the formula for the position of a percentile in a data set with n observations sorted in ascending order.

A

Ly=y 100 (n+1)

where:

y = percentage point at which we are dividing the distribution

Ly = location (L) of the percentile (Py) in the data set sorted in ascending order

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is the harmonic mean used, and what is the formula used to calculate it?

A

To determine the average cost of shares in dollar cost averaging. The dollar amount periodically invested can be divided by the share price to determine the number of shares at each purchase. The total amount invested is then divided by the total number of shares to determine the average cost of shares.

XH=¯n∑ni=1(1/Xi)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How are the sample and population variance calculated?

A

σ2=n∑i=1(Xi−––X)2

where:

Xi = observation i; μ = population mean; and N = size of the population

s2=∑ni=1(Xi−––X)2n−1

where:

n = sample size;
¯¯¯
X, the sample mean, is used in place of μ

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

Define mean absolute deviation (MAD) and give the calculation.

A

The mean absolute deviation is the average of the absolute values of deviations of observations from the mean.

MAD
=∑ni=1∣∣
∣∣Xi−––X∣∣∣∣n

where:

n = number of items in the data set

¯¯¯
X
=
the arithmetic mean of a sample

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

Identify the important relationships between the arithmetic mean and geometric mean of a data set. And identify when a geometric mean is frequently used.

A

The geometric mean is always less than or equal to the arithmetic mean.

The geometric mean equals the arithmetic mean only when all the observations are identical.

The difference between the geometric and arithmetic mean increases as the dispersion of observed values increases.

A geometric mean is frequently used when calculating average rates of return over multiple periods or to compute the growth rate of a variable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Describe nominal, ordinal, interval, and ratio scales.

A

Nominal: weakest level of measurement; categorize or count data but do not rank them

Ordinal: stronger level of measurement than nominal scales; sort data in categories that are ranked according to a certain characteristic

Interval: rank observations such that the differences between scale values are equal so that values can be added and subtracted meaningfully

Ratio: have all the characteristics of interval scales and true zero point as the origin; strongest measurement level

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Define median and mode.

A

The median is the value of the middle item of a data set once it has been arranged in ascending or descending order.

The mode is used to identify the most frequently occurring value of a data set.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Distinguished between a return distribution with positive skewness and one with negative skewness.

A

A return distribution with positive skewness has frequent small losses and few large gains, and rmode < rmedian < rmean.

A return distribution with negative skewness has frequent small gains and few large losses, and rmean < rmedian < rmode.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Explain why the geometric mean is more appropriate for reporting historical returns.

A

Because it equals the rate of growth an investor would have to earn each year to match actual cumulative investment performance. The geometric mean is an excellent measure of past performance.

17
Q

Describe relative frequency for an interval.

A

The proportion or fraction of total observations in an interval. Each interval’s relative frequency is calculated by dividing its absolute frequency by the total number of observations.

18
Q

What is a frequency distribution?

A

A tabular illustration of data categorized into a small number of intervals or classes. Each observation must fall into only one interval (mutually exclusive), and the set of intervals must cover the entire range of values represented in the data (all-inclusive). The data presented in a frequency distribution may be measured using any type of measurement scale.

19
Q

When is a distribution symmetrical?

A

When the mean, median, and mode are equal.

20
Q

Give the formula used to calculate the Sharpe ratio, and identify what the ratio measures.

A

S=–––rp−rfsp

where:

¯
rp = mean portfolio return

rf = risk-free return

sp = standard deviation of portfolio returns

The Sharpe ratio measures the excess return per unit of risk.

21
Q

Give the formula used to calculate sample excess kurtosis.

A
K
E=
⎛⎜
⎜
⎜
⎝n(n+1)(n−1)(n−2)(n−3)∑ni=1(Xi−––X)4s4
⎞
⎟
⎟
⎟
⎠−3(n−1)2(n−2)(n−3)

Kurtosis itself is the first term.

22
Q

In what way is a histogram used, and what is the advantage of using it?

A

A histogram graphically represents the data contained in a frequency distribution.

The advantage of a histogram is that users can quickly recognize where most of the data is concentrated.