Week Five Flashcards

(29 cards)

0
Q

What do inductive statistics provide?

A

The means to estimate population parameters from sample statistics and test hypotheses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
1
Q

What are the two golden rules of quants?

A
  1. Correlation does not imply causation.

2. r is not the line of best fit.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

At what level of measurement do we use parametric tests?

A

The continuous level - interval or ratio.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Parametric tests can only be used when?

A

The data is normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is the rule of thumb about big enough sample sizes?

A

The sample must be greater than 30 to be deemed large enough.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Define the sample distribution of the sample mean

A

This is the distribution of sample statistics that would be obtained if a large number of random samples of a given size were drawn from a given population.
It is a hypothetical distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

At what level of measurement do we use non-parametric tests?

A

They can be used with nominal or ordinal level data.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Non-parametric tests can be used when?

A

We have a small sample and when we don’t know anything about the underlying population.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the Central Limit Theorem (CLT)?

A

The theorem states that the sampling distribution of the sample mean approaches a normal distribution as the sample size grows.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What must happen for the CLT to apply?

A

There must be at least 10 cases of an event happening and of the event not happening.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What do we use the CLT for?

A

The CLT mean that we are able to use the sample statistics to estimate the value of population parameters, and indicate how accurate the estimates are.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the four fundamental propositions of the CLT?

A
  1. Regardless of the distribution of the population, the sampling distribution of the sample mean will be approximately normal if the sample size is big enough.
  2. The mean of the sampling distribution of the sample means will be equal to the unknown population.
  3. The standard deviation of the sampling distribution indicates the range of possible error.
    The standard deviation of the sample is the best estimate of the population standard deviation.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What is the equation for standard error?

A

Standard error = population SD ➗ the square root of the number of cases.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How do we reduce standard error?

A

Increase the sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Explain confidence intervals.

A

If we known the sample mean we can fix a confidence interval around it to give us a interval estimate, which will predict the parameters the true population mean will fall in.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the standard confidence interval? And what does it mean?

A

95% confidence interval.

The 95% confidence interval means that we expect to be correct that the interval will include the population mean 95% of the time.

16
Q

How can we increase confidence in interval estimates?

A

If you want to be more confident that the interval estimate includes the population mean we can choose a wider confidence interval, at the cost of being less precise. E.g. 99% confidence interval.

17
Q

How can we increase precision of interval estimates?

A

If you want to be more precise in the estimate of the population mean then you can choose a narrower confidence interval, at the cost of being lest confident. E.g. 90% confidence interval.

18
Q

What do we need to find the confidence interval?

A

The population SD and the z-value.

19
Q

How do you calculate confidence intervals?

A

CI = mean + or - z-score x sampling error.

20
Q

How do we calculate the alpha?

A

Alpha = 1 - the confidence interval.

21
Q

When do we use the t-distribution?

A

When dealing with small samples and where we do not know the population SD.

22
Q

For the same level of alpha the t-distribution uses?

A

A slightly wider confidence interval.

23
Q

What does the t-distribution look like?

A

It is symmetrical and bell-shaped.

24
As the sample size increases the t-distribution will?
Approximate the normal distribution.
25
What is type 1 error?
We make a Type 1 error when we reject a true null. Where we say there is a relationship when there isn't.
26
What is a type 2 error?
We make a type 2 error when we reject a true hypothesis. When we say there is not relationship when there is one.
27
What does the alpha level tell us?
The probability of making a type 1 error. When the p-value is greater than the alpha we accept the null.
28
What does the beta level tell us?
The probability of making a type 2 error and the power of the test. (1 - b)