Slides 12 ⋆. π™šΛš Flashcards

1
Q

What is the Null Hypothesis?

A

A statement that there is no effect or no difference, serving as a default position in hypothesis testing

The Null Hypothesis is often denoted as H0.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is the Alternative Hypothesis?

A

A statement that contradicts the Null Hypothesis, suggesting that there is an effect or a difference

The Alternative Hypothesis is often denoted as H1 or Ha.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What is the difference between a 1-Sided and 2-Sided test?

A

A 1-Sided test checks for an effect in one direction, while a 2-Sided test checks for an effect in both directions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a Significance Level (Ξ±)?

A

The probability of rejecting the Null Hypothesis when it is actually true, commonly set at 0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What does a p-value represent?

A

The probability of obtaining test results at least as extreme as the observed results, given that the Null Hypothesis is true.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Is a good p-value sufficient for practical significance?

A

No, a β€˜good’ p-value does not imply that the results are practically useful or interesting.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

How is the significance level (Ξ±) related to Type 1 and Type 2 errors?

A

A smaller Ξ± reduces the probability of a Type 1 error (false positive), while a larger Ξ± reduces the probability of a Type 2 error (false negative).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

When should you choose a smaller significance level?

A

When a false positive is especially dangerous or costly.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the Central Limit Theorem (CLT) condition?

A

The condition that, with a large enough sample size, the sampling distribution of the sample mean will be approximately normally distributed.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What is the t-distribution?

A

A probability distribution used when the sample size is small and population standard deviation is unknown, characterized by thicker tails than the normal distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the key characteristics of the t-distribution?

A
  • Bell-shaped
  • Centered at zero
  • Has thicker tails than the normal distribution
  • Defined by degrees of freedom (df = n - 1)
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When is the t-distribution nearly indistinguishable from the normal distribution?

A

When the degrees of freedom is about 30 or more.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is a t-test?

A

A statistical test used to compare the means of two groups when the sample size is small and the population variance is unknown.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the significance level (Ξ±) for a typical hypothesis test?

A

Commonly set at 0.05.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What is the average time for all runners who finished the Cherry Blossom Race in 2006?

A

93.29 minutes.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What should you check for when determining normality in a sample?

A

Look for clear outliers and ensure adequate sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

What is a key consideration when using the t-distribution with small samples?

A

The estimates become less precise, leading to potential issues when using the normal distribution for modeling.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

What is the formula for degrees of freedom in a t-distribution?

A

df = n - 1.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

What is the typical sample size needed to rely on the normal distribution?

A

Typically, a sample size greater than 30.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

What is the sample size for the Cherry Blossom Race study in 2017?

A

100 randomly sampled participants.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

What should you do if extreme outliers are present in your sample?

A

Reassess the normality condition as it may not be satisfied.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

In hypothesis testing, what does a two-sided test account for?

A

The probability of obtaining a more extreme value on both sides of the distribution.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

Fill in the blank: The p-value is different than the __________.

A

[size of the effect].

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

What is the significance level Ξ± used in the hypothesis test?

A

0.05

This significance level is commonly used in hypothesis testing.

25
What is the final p-value we will compare to our significance level?
0.0197 ## Footnote This p-value indicates the strength of the evidence against the null hypothesis.
26
What is the conclusion if the p-value is smaller than alpha?
Reject the null hypothesis ## Footnote This indicates that there is statistically significant evidence to suggest a difference.
27
What evidence does the data provide regarding the average run time for the Cherry Blossom Run in 2017 compared to 2006?
The average run time is different ## Footnote The observed value indicates that runners in 2017 were slower on average.
28
What were the two hypotheses (H0 and HA) regarding traffic flow on Friday the 13th?
* H0: Average traffic flow on Friday 6th and 13th are equal. * HA: Average traffic flow on Friday 6th and 13th are different. ## Footnote These hypotheses were tested to investigate the impact of superstition on driving behavior.
29
What is the sample size (n) of the traffic flow data collected?
10 ## Footnote A small sample size can affect the reliability of the statistical tests.
30
What statistical distribution is used to model the sample mean when the sample size is small?
t-distribution ## Footnote The t-distribution is used instead of the normal distribution for small sample sizes.
31
What does the Central Limit Theorem (CLT) state about sampling distributions?
They will be nearly normal if the population distribution is nearly normal. ## Footnote This is true regardless of sample size, but small samples are harder to verify for normality.
32
What is the critical value used in constructing confidence intervals when using the t-distribution?
t* ## Footnote The critical value changes based on degrees of freedom and sample size.
33
How is the margin of error (ME) calculated for confidence intervals?
ME = critical value * SE ## Footnote SE stands for standard error.
34
What does a p-value of 0.0008 indicate in the context of the hypothesis test?
Strong evidence against the null hypothesis ## Footnote This low p-value suggests a significant difference in traffic flow.
35
What is being investigated regarding people's behavior on Friday the 13th?
Whether people drive less compared to the previous week ## Footnote This is linked to superstitious behavior.
36
What is the function used in R to calculate the p-value for the t-test?
pt() ## Footnote This function is specifically for the t-distribution.
37
In what scenario should researchers be cautious when verifying the normality condition?
When working with small sample sizes ## Footnote Small samples can lead to unreliable assessments of normality.
38
What is a confidence interval (CI) used for in hypothesis testing?
To estimate the true population difference ## Footnote CIs provide a range of values that likely contain the population parameter.
39
How is the critical t-value determined for constructing a 95% confidence interval?
Using the qt() function with 0.975 as input ## Footnote This splits the 5% uncertainty for a 95% CI.
40
What is the input value used for qt() when calculating a 95% confidence interval?
0.975
41
In constructing a confidence interval for a small sample mean, what does the 5% uncertainty imply?
Half into the upper tail and half into the lower tail
42
What is the average adult weight of an African elephant?
5 tons
43
What was the sample mean weight of the 120 forest elephants in the study?
3.5 tons
44
What was the standard deviation of the sample weight of the forest elephants?
1 ton
45
Is there strong evidence that forest elephants are a different size based on the sample data?
This question requires statistical analysis to answer
46
If Curry attempted 200 shots from 2-3 feet, what would we analyze?
How confident we could be that his shooting is better than the average NBA player
47
What is the formula for standard error when calculating pHat?
sqrt(p*(1-p)/n)
48
What is the value of Zstar1 used in confidence interval calculations?
1.96
49
What is the value of Zstar2 used in confidence interval calculations?
2.576
50
What is the formula for calculating the standard error of a mean (SE) given standard deviation (s) and sample size (n)?
s/sqrt(n)
51
Fill in the blank: The formula for Tdiff is (1836 - 0) / ______.
SEdiff
52
What is the sample size (n) used in the analysis of the forest elephants?
120
53
What is the mean (mu) value used in the analysis?
5
54
What is the sample mean (Xbar) of the forest elephants?
3.4
55
What does SE represent in statistical analysis?
Standard Error
56
True or False: The sample data suggests that forest elephants are larger than the average adult African elephant.
This requires further statistical evidence
57
What is the formula used to calculate z in hypothesis testing?
(pHat - p) / SE
58
What is the standard error (SE) for a sample proportion with p = 0.15 and n = 25?
sqrt(0.15*0.85/25)
59
Which statistical software is mentioned for demonstrating the calculation of t*?
R