Lecture 7 Flashcards

1
Q

What are Computer Intensive Techniques (resampling/CIT)?

A

use of computers to compute thousands of new samples, divergent statistics or other values of interest to analyze and to validate models.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a Parametric assumption?

A

based on assumptions regarding the population and the parameter of interest. A normal distribution is expected. If the expectations are not achieved, we switch to non-parametric statistics.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is a non-parametric assumption?

A

tests make no strong assumptions about the population, sometimes called “distribution free”.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the 3 types of CIT?

A
  1. Bootstrap sampling distributions and confidence intervals.
  2. Monte Carlo methods.
  3. Randomization tests, permutation tests, exact tests.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Are CIT approaches essentially parametric or non-parametric?

A

Non-parametric.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is Bootstrapping?

A

It is “resampling with replacement” from one sample with size n.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What can we do after Bootstrapping?

A

Compute any statistic (mean, median, standard dev. etc.) and store this value. This process is repeated 1000 times.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What can we do after computing the statistics (Bootstrapping)?

A

With the results, a distribution is constructed, and this is compared to the initial statistic drawn from the original sample. Statistical signigicance of the original is being established.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What are 4 advantages of Bootstrapping?

A
  1. Simple test, especially with a small sample size.
  2. It allows simulating the sampling distribution of many different statistics.
  3. Useful when the variables are not normally distributed.
  4. Useful when the relationship between variables is not linear.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What are 2 disadvantages of Bootstrapping?

A
  1. The new sampling distribution of the mean is NOT equal to the real sampling distribution of the mean.
  2. The sample must be representative for the population.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What is the Monte Carlo method?

A

Method to calculate the area of (usually) more complex areas.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How does the Monte Carlo method work?

A

Randomly throw 1000 arrows in a known area positioned around the area you want to calculate. The chance of being “hit” is the same for every spot. if 50% of the arrows hit the area you want to calculate, you know that the surface area is 50% of the known area.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What are 4 advantages/characteristics of the Monte Carlo method?

A
  1. It can solve problems that have a probabilistic interpretation.
  2. When the data set does not meet the requirements for parametric or asymptotic methods, MC can be used.
  3. It involves repeatedly sampling in order to obtain a good estimate or approximation of the exact p-value.
  4. Computing an exact p-value is possible via exact tests and randomization tests, but only for small data sets.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the order for statistical testing (MC method)? (7 steps)

A
  1. State null hypothesis and alternative hypothesis.
  2. State significance level: alpha.
  3. Compute degrees of freedom.
  4. Compute decision rule, critical region, critical values.
  5. Compute test statistic and confidence interval.
  6. State the results.
  7. State the conclusion.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is given in asymptotic methods?

A

A sufficient sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is the one-sample T-test formula?

A

t = (Sample mean - Proposed constant for the population mean) / (Sample standard deviation / squared root of sample size)

17
Q

What is the Z-test formula?

A

Z = (Sample mean - population mean) / (population standard deviation / squared root of sample size)