hypothesis testing Flashcards by Anita Ferrari

What are the types of hypotheses?

Research hypothesis (the question being investigated)

Null hypothesis (𝐻0): The hypothesis that is tested

Alternative hypothesis (𝐻1 or 𝐻𝐴): The opposite of 𝐻0

How well did you know this?

Not at all

Perfectly

What is a confidence interval?

An interval (lower & upper limit) within which the true value of a population parameter lies with a specified confidence level.

How well did you know this?

Not at all

Perfectly

How do hypothesis tests differ from confidence intervals?

Hypothesis tests assess whether a single value is the true parameter.

Confidence intervals estimate a range where the true parameter likely falls.

How well did you know this?

Not at all

Perfectly

What are the steps in a hypothesis test?

Specify the hypothesis
Obtain a test statistic from the data
Compare the test statistic to a reference distribution

How well did you know this?

Not at all

Perfectly

What is the null hypothesis (𝐻0)?

The hypothesis being tested, the test determines how much evidence the data provides to support this hypothesis.

Example:
𝐻0: 𝜇 = 3 (Mean density is 3 birds/km²)

How well did you know this?

Not at all

Perfectly

What is the alternative hypothesis (𝐻1 or 𝐻𝐴)?

The hypothesis that contradicts 𝐻0, suggesting a difference or effect.

Example:
𝐻1: 𝜇 ≠ 3 (Mean density is not 3 birds/km²)

How well did you know this?

Not at all

Perfectly

What are one-tailed and two-tailed tests?

One-tailed: Tests for a directional effect (e.g., 𝐻1: 𝜇 < 3 or 𝐻1: 𝜇 > 3)

when an effect can only occur in one direction

an effect can occur in both directions but only one direction is of interest.

How well did you know this?

Not at all

Perfectly

What is two-tailed tests?

Two-tailed: Tests for any difference (e.g., 𝐻1: 𝜇 ≠ 3)

How well did you know this?

Not at all

Perfectly

What do we compare the test statistic to?

A reference distribution (e.g., t-distribution) to determine if the observed difference is significant.

How well did you know this?

Not at all

Perfectly

How does variability affect hypothesis testing?

Low variability → Easier to detect a true difference

High variability → Harder to conclude significant differences

How well did you know this?

Not at all

Perfectly

What types of statistical tests are covered?

One-sample & two-sample t-tests

ANOVA (more than two groups)
z-tests (proportions)
Chi-square tests (categorical data)
Linear regression (t and F tests)

How well did you know this?

Not at all

Perfectly

How is a t-statistic calculated in a one-sample t-test?

tstat = (data estimate - hypothesised value) / SE(data estimate)

How well did you know this?

Not at all

Perfectly

what will the tstat be if H0 is true

the test statistic (𝑡𝑠𝑡𝑎𝑡
) will be small (dependent on sampling variability) because the difference between the data-estimate (sample mean) and the hypothesised value is small.

How well did you know this?

Not at all

Perfectly

what will the tstat be if H0 is false

the test statistic will be large (dependent on sampling variability) because the difference between the data-estimate and the hypothesised value is large.

How well did you know this?

Not at all

Perfectly

What distribution is typically used as the reference distribution in these examples?

The t-distribution is used as the reference distribution.

How well did you know this?

Not at all

Perfectly

What do the degrees of freedom (df) for the t-distribution depend on?

The degrees of freedom depend on what is being tested.

How well did you know this?

Not at all

Perfectly

What are the two ways the reference distribution helps determine the strength of evidence for the null hypothesis?

By obtaining an exact probability for the test statistic.
By comparing the test statistic to a critical value based on a predetermined significance level.

How well did you know this?

Not at all

Perfectly

In a one-sample two-tailed test, what is the null hypothesis (H0) and alternative hypothesis (H1)?

-𝐻0:𝜇=3.6

𝐻1:𝜇≠3.6

How well did you know this?

Not at all

Perfectly

What is the reference distribution for the test with
𝑛=16?

𝑡𝑑𝑓=𝑛−1=𝑡15

How well did you know this?

Not at all

Perfectly

In a two-tailed test, how is the area in the two tails interpreted?

The area in the tails represents the probability of obtaining a test statistic as extreme or more extreme than the observed value.

How well did you know this?

Not at all

Perfectly

How do you calculate the area in the two tails for the test statistic −0.753?

Study These Flashcards

Add the area in the left tail (< -0.753) and the right tail (> 0.753): 0.226+0.226=0.452

What is a p-value in hypothesis testing?

Study These Flashcards

The p-value is the probability of observing a test statistic as extreme, or more extreme, than the one observed, assuming the null hypothesis is true.

What does the p-value quantify in hypothesis testing?

Study These Flashcards

The p-value quantifies the chance of observing the data (or something more extreme) if the null hypothesis (H0) is true.

When is the null hypothesis typically rejected based on the p-value?

Study These Flashcards

The null hypothesis (𝐻0) is usually rejected when the p-value is very small.

What are some common threshold values (significance levels) for p-values?

0.10 → No evidence against 𝐻0 0.05 → Weak evidence against 𝐻0 0.01 → Some evidence against 𝐻0 0.001 → Strong evidence against 𝐻0

What does a large p-value indicate?

The test statistic is likely under 𝐻0 We fail to reject 𝐻0

What does a small p-value indicate?

The test statistic is unlikely under 𝐻0 We reject 𝐻0 in favor of 𝐻1

What does it mean to "fail to reject the null hypothesis"?

It means that the p-value is large, indicating that the test statistic is likely to occur if the null hypothesis is true. Therefore, there is no strong evidence against 𝐻0

What does it mean to "reject the null hypothesis"?

It means the p-value is small, indicating that the test statistic is very unlikely to occur if the null hypothesis is true. This provides evidence in favor of the alternative hypothesis (𝐻1)

What is the relationship between a large p-value and the test statistic?

A large p-value suggests that the test statistic is likely to occur under the null hypothesis, providing no strong evidence to reject 𝐻0

What is Fixed Level Significance Testing in hypothesis testing?

Fixed Level Significance Testing involves comparing the test statistic to a critical value based on a fixed significance level (e.g., 0.05, 0.1) to decide whether to reject the null hypothesis.

What are statistical tables used for in significance testing?

Statistical tables provide critical values (quantiles) for different reference distributions (like the t-distribution) at various significance levels.

What does the critical value represent in significance testing?

The critical value is the threshold beyond which the test statistic would be considered extreme enough to reject the null hypothesis.

In a two-tailed test with a 5% significance level, how is the significance level distributed in the tails?

2.5% in the lower tail 2.5% in the upper tail

What is a two-sample t-test used for?

A two-sample t-test is used to compare the means of two groups to test if there is a statistically significant difference between them.

What are the null and alternative hypotheses in a two-sample t-test?

𝐻0:𝜇𝐴−𝜇𝐵=0 (no difference between means) 𝐻1:𝜇𝐴−𝜇𝐵≠0 (a difference exists between means)

What formula is used to calculate the test statistic in a two-sample t-test?

tstat = (μ^A-μ^B)−0) / SE(μ^A−μ^B)

What happens if the test statistic is less extreme than the critical value in a fixed-level test?

The null hypothesis is not rejected, indicating that the observed data does not provide strong evidence against the null hypothesis.

Why do we divide the significance level equally between two tails in a two-tailed test?

Because we are testing for a difference in either direction (greater or smaller), so the probability of an extreme result must be shared between both tails.

What is the conservative assumption made about variance in the two-sample t-test?

It is assumed that the variances are unequal, making the test more conservative.

What is a paired t-test used for?

A paired t-test is used to compare two dependent groups where observations in one sample are paired with observations in the other sample (e.g., before and after treatment).

What is the formula for the paired t-test statistic?

tstat = (μ^d−0)/ SE(μ^d)

How is the standard error for the paired t-test calculated?

SE(μ^d)=SDd / sqrt(n) μ^d = Mean of the differences 𝑆𝐷𝑑 = Standard deviation of the differences 𝑛 = Sample size

What are the key assumptions for t-tests?

Independence of data within and between groups Normal distribution of data (assess via histograms or Shapiro-Wilk test) t-tests are robust to non-normal data if sample sizes are similar.

What should you do if normality is unreasonable in a dataset?

Use non-parametric tests, such as the Mann-Whitney-Wilcoxon test.

What does the Mann-Whitney-Wilcoxon test compare?

It compares the ranks of two groups to test if their distributions are the same or different.

How is the Mann-Whitney U test statistic calculated?

U=W− (n(n+1))/2 W = Sum of the ranks 𝑛 = Sample size

What does a large p-value in the Mann-Whitney test indicate?

A large p-value suggests no evidence of a difference between the two groups.

How can you assess the practical significance of a result?

Present the effect size and confidence interval to provide context beyond just statistical significance.

What are the key steps in hypothesis testing?

Specify null and alternative hypotheses Calculate a test statistic Compare to a reference distribution Use data to determine the strength of evidence for the null hypothesis

hypothesis testing Flashcards

(50 cards)