Sampling and Estimation Flashcards

1
Q

What is simple random sampling?

A

A method of selecting a sample in such a way that each item or person in the population being studied has the same probability of being included in the sample.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a sampling distribution?

A

The distribution of all values that a sample statistic can take on when computed from samples of identical size randomly drawn from the same population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is sampling error?

A
  1. The difference between a sample statistic and its corresponding population parameter.
  2. Using sample data presents the risk that results found in an analysis do not represent the results that would be obtained from using data involving the entire population from which the sample was derived.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is stratified random sampling?

A
  1. Involves randomly selecting samples proportionally from subgroups that are formed based on one or more distinguishing characteristics, so that the sample will have the same distribution of these characteristics as the overall population.
  2. Ex: Stratification based on age groups of population
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is time series data?

A

Time-series data consists of observations taken at specific and equally spaced points in time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is cross sectional data?

A

Cross-sectional data consists of observations taken at a single point in time.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What is the central limit theorem?

A

States that for a population with a mean μ and a finite variance σ2, the sampling distribution of the sample mean of all possible samples of size n (for n ≥ 30) will be approximately normally distributed with a mean equal to μ and a variance equal to σ2 / n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the standard error of the sample mean?

A

The standard deviation of the distribution of the sample means.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the calculation for the standard error of the sample mean, when the the population standard deviation is known?

A

where σ, the population standard deviation

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the calculation for the standard error of the sample mean, when the the population standard deviation is unknown?

A

Where s, the sample standard deviation, is used because the population standard deviation is unknown

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the three desirable properties of an estimator?

A
  1. Unbiasedness (sign of estimation error is random),
  2. Efficiency (lower sampling error than any other unbiased estimator), and
  3. Consistency (variance of sampling error decreases with sample size).
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is a point estimate?

A
  1. Point estimates are single value estimates of population parameters.
  2. Ex: the sample mean is essentially a point estimate of a population mean.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is an estimator?

A
  1. An estimator is a formula used to compute a point estimate.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is a confidence interval?

A

Ranges of values, within which the actual value of the parameter will lie with a given probability.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the calculation for the confidence interval?

A

confidence interval = point estimate ± (reliability factor × standard error)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the reliability factor?

A

A number that depends on the sampling distribution of the point estimate and the probability that the point estimate falls in the confidence interval.

17
Q

Describe the properties of Student’s t-distribution:

A
  1. Similar, but not identical, to the normal distribution in shape.
  2. Defined by the degrees of freedom
  3. Has fatter tails compared to the normal distribution.
18
Q

What is the calculation for Student’s t-distribution’s degress of freedom?

A

n − 1

19
Q

Does a greater or lesser degree of freedom(df) make the Student’s t-distribution more closely resemble a normal distribution?

A
  1. Student’s t–distribution is closer to the normal distribution when df is greater.
20
Q

What effect does a greater degree of freedom have on the confidence interval in student’s t-distribution?

A

Confidence intervals are narrower when df is greater.

21
Q

For a normally distributed population, when can a z-statistic be used to construct a confidence interval for it’s mean?

A
  1. When variance is known
  2. Acceptable in the case of a normal population with an unknown variance if the sample size is large (30+).
22
Q

For a normally distributed population, when can a t-statistic be used to construct a confidence interval for it’s mean?

A

When the variance is unknown

23
Q

What effect does increasing sample size have on parameter estimates and confidence intervals?

A
  1. Generally improve parameter estimates
  2. Narrow confidence intervals.
24
Q

What are the five types of sampling method bias?

A
  1. Data mining (significant relationships that have occurred by chance),
  2. Sample selection bias (selection is non-random),
  3. Look-ahead bias (basing the test at a point in time on data not available at that time),
  4. Survivorship bias (using only surviving mutual funds, hedge funds, etc.), and
  5. Time-period bias (the relation does not hold over other time periods).