hypothesis testing Flashcards

1
Q

What are the types of hypotheses?

A

Research hypothesis (the question being investigated)

Null hypothesis (𝐻0): The hypothesis that is tested

Alternative hypothesis (𝐻1 or 𝐻𝐴): The opposite of 𝐻0

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is a confidence interval?

A

An interval (lower & upper limit) within which the true value of a population parameter lies with a specified confidence level.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How do hypothesis tests differ from confidence intervals?

A

Hypothesis tests assess whether a single value is the true parameter.

Confidence intervals estimate a range where the true parameter likely falls.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What are the steps in a hypothesis test?

A
  1. Specify the hypothesis
  2. Obtain a test statistic from the data
  3. Compare the test statistic to a reference distribution
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is the null hypothesis (𝐻0)?

A

The hypothesis being tested, the test determines how much evidence the data provides to support this hypothesis.

Example:
𝐻0: 𝜇 = 3 (Mean density is 3 birds/km²)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is the alternative hypothesis (𝐻1 or 𝐻𝐴)?

A

The hypothesis that contradicts 𝐻0, suggesting a difference or effect.

Example:
𝐻1: 𝜇 ≠ 3 (Mean density is not 3 birds/km²)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What are one-tailed and two-tailed tests?

A

One-tailed: Tests for a directional effect (e.g., 𝐻1: 𝜇 < 3 or 𝐻1: 𝜇 > 3)

when an effect can only occur in one direction

an effect can occur in both directions but only one direction is of interest.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is two-tailed tests?

A

Two-tailed: Tests for any difference (e.g., 𝐻1: 𝜇 ≠ 3)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What do we compare the test statistic to?

A

A reference distribution (e.g., t-distribution) to determine if the observed difference is significant.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How does variability affect hypothesis testing?

A

Low variability → Easier to detect a true difference

High variability → Harder to conclude significant differences

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What types of statistical tests are covered?

A

One-sample & two-sample t-tests

ANOVA (more than two groups)
z-tests (proportions)
Chi-square tests (categorical data)
Linear regression (t and F tests)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is a t-statistic calculated in a one-sample t-test?

A

tstat = (data estimate - hypothesised value) / SE(data estimate)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

what will the tstat be if H0 is true

A

the test statistic (𝑡𝑠𝑡𝑎𝑡
) will be small (dependent on sampling variability) because the difference between the data-estimate (sample mean) and the hypothesised value is small.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

what will the tstat be if H0 is false

A

the test statistic will be large (dependent on sampling variability) because the difference between the data-estimate and the hypothesised value is large.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What distribution is typically used as the reference distribution in these examples?

A

The t-distribution is used as the reference distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What do the degrees of freedom (df) for the t-distribution depend on?

A

The degrees of freedom depend on what is being tested.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What are the two ways the reference distribution helps determine the strength of evidence for the null hypothesis?

A
  1. By obtaining an exact probability for the test statistic.
  2. By comparing the test statistic to a critical value based on a predetermined significance level.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

In a one-sample two-tailed test, what is the null hypothesis (H0) and alternative hypothesis (H1)?

A

-𝐻0:𝜇=3.6

𝐻1:𝜇≠3.6

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the reference distribution for the test with
𝑛=16?

A

𝑡𝑑𝑓=𝑛−1=𝑡15

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

In a two-tailed test, how is the area in the two tails interpreted?

A

The area in the tails represents the probability of obtaining a test statistic as extreme or more extreme than the observed value.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

How do you calculate the area in the two tails for the test statistic −0.753?

A

Add the area in the left tail (< -0.753) and the right tail (> 0.753): 0.226+0.226=0.452

22
Q

What is a p-value in hypothesis testing?

A

The p-value is the probability of observing a test statistic as extreme, or more extreme, than the one observed, assuming the null hypothesis is true.

23
Q

What does the p-value quantify in hypothesis testing?

A

The p-value quantifies the chance of observing the data (or something more extreme) if the null hypothesis (H0) is true.

24
Q

When is the null hypothesis typically rejected based on the p-value?

A

The null hypothesis (𝐻0) is usually rejected when the p-value is very small.

25
What are some common threshold values (significance levels) for p-values?
0.10 → No evidence against 𝐻0 0.05 → Weak evidence against 𝐻0 0.01 → Some evidence against 𝐻0 0.001 → Strong evidence against 𝐻0
26
What does a large p-value indicate?
The test statistic is likely under 𝐻0 We fail to reject 𝐻0
27
What does a small p-value indicate?
The test statistic is unlikely under 𝐻0 We reject 𝐻0 in favor of 𝐻1
28
What does it mean to "fail to reject the null hypothesis"?
It means that the p-value is large, indicating that the test statistic is likely to occur if the null hypothesis is true. Therefore, there is no strong evidence against 𝐻0
29
What does it mean to "reject the null hypothesis"?
It means the p-value is small, indicating that the test statistic is very unlikely to occur if the null hypothesis is true. This provides evidence in favor of the alternative hypothesis (𝐻1)
30
What is the relationship between a large p-value and the test statistic?
A large p-value suggests that the test statistic is likely to occur under the null hypothesis, providing no strong evidence to reject 𝐻0
31
What is Fixed Level Significance Testing in hypothesis testing?
Fixed Level Significance Testing involves comparing the test statistic to a critical value based on a fixed significance level (e.g., 0.05, 0.1) to decide whether to reject the null hypothesis.
32
What are statistical tables used for in significance testing?
Statistical tables provide critical values (quantiles) for different reference distributions (like the t-distribution) at various significance levels.
33
What does the critical value represent in significance testing?
The critical value is the threshold beyond which the test statistic would be considered extreme enough to reject the null hypothesis.
34
In a two-tailed test with a 5% significance level, how is the significance level distributed in the tails?
2.5% in the lower tail 2.5% in the upper tail
35
What is a two-sample t-test used for?
A two-sample t-test is used to compare the means of two groups to test if there is a statistically significant difference between them.
36
What are the null and alternative hypotheses in a two-sample t-test?
𝐻0:𝜇𝐴−𝜇𝐵=0 (no difference between means) 𝐻1:𝜇𝐴−𝜇𝐵≠0 (a difference exists between means)
37
What formula is used to calculate the test statistic in a two-sample t-test?
tstat = (μ^A-μ^B)−0) / SE(μ^A−μ^B)
38
What happens if the test statistic is less extreme than the critical value in a fixed-level test?
The null hypothesis is not rejected, indicating that the observed data does not provide strong evidence against the null hypothesis.
39
Why do we divide the significance level equally between two tails in a two-tailed test?
Because we are testing for a difference in either direction (greater or smaller), so the probability of an extreme result must be shared between both tails.
40
What is the conservative assumption made about variance in the two-sample t-test?
It is assumed that the variances are unequal, making the test more conservative.
41
What is a paired t-test used for?
A paired t-test is used to compare two dependent groups where observations in one sample are paired with observations in the other sample (e.g., before and after treatment).
42
What is the formula for the paired t-test statistic?
tstat = (μ^d−0)/ SE(μ^d)
43
How is the standard error for the paired t-test calculated?
SE(μ^d)=SDd / sqrt(n) μ^d = Mean of the differences 𝑆𝐷𝑑 = Standard deviation of the differences 𝑛 = Sample size
44
What are the key assumptions for t-tests?
Independence of data within and between groups Normal distribution of data (assess via histograms or Shapiro-Wilk test) t-tests are robust to non-normal data if sample sizes are similar.
45
What should you do if normality is unreasonable in a dataset?
Use non-parametric tests, such as the Mann-Whitney-Wilcoxon test.
46
What does the Mann-Whitney-Wilcoxon test compare?
It compares the ranks of two groups to test if their distributions are the same or different.
47
How is the Mann-Whitney U test statistic calculated?
U=W− (n(n+1))/2 ​ W = Sum of the ranks 𝑛 = Sample size
48
What does a large p-value in the Mann-Whitney test indicate?
A large p-value suggests no evidence of a difference between the two groups.
49
How can you assess the practical significance of a result?
Present the effect size and confidence interval to provide context beyond just statistical significance.
50
What are the key steps in hypothesis testing?
Specify null and alternative hypotheses Calculate a test statistic Compare to a reference distribution Use data to determine the strength of evidence for the null hypothesis