Q1-R, 2/13 Flashcards
(16 cards)
What is the primary reason for using a t-test instead of a z-test?
For n<30 we can’t use the z-distribution and must use t-test instead
The t-test is preferred for smaller sample sizes due to the increased variability in estimating the population standard deviation.
What does the T-distribution represent?
A family of curves based on degrees of freedom (n-1)
The T-distribution is similar to the normal distribution but has heavier tails, which account for the additional uncertainty in smaller samples.
What is the relationship between the t-distribution and the normal distribution as sample size increases?
Approaches normal as n increases
As the sample size increases, the t-distribution becomes more similar to the normal distribution.
What are the assumptions when using z or t distributions?
- Data come from a random sample of the population
- The sample average is approximately normal
- Either the data are normal or the CLT applies
The Central Limit Theorem (CLT) states that the distribution of the sample mean approaches a normal distribution as the sample size becomes large.
What do p-values indicate?
The probability that our data would have occurred under the null hypothesis
A small p-value suggests that the observed data is unlikely under the null hypothesis, leading to its potential rejection.
What does a small p-value imply?
The occurrence is unlikely under the null hypothesis
It does not imply that the null hypothesis is false, just that the data observed is rare under that hypothesis.
True or False: The p-value measures the probability that the studied hypothesis is true.
False
The p-value does not provide any information about the truth of the hypothesis itself.
True or False: The p-value measures the size of the effect.
False
The p-value does not provide any information about the size of the observed effect.
True or False: The p-value tells us the importance of a result.
False
The p-value does not provide any information about the importance of the result.
What is the significance level in hypothesis testing?
Our preselected criterion to accept or reject the null hypothesis
Common significance levels include 0.05, 0.01, and 0.10.
What differentiates independent samples from dependent samples?
- Independent samples: values in one sample reveal no information about the other
- Dependent samples: values in one sample affect the values in the other
Dependent samples are often paired or matched in some way, such as before-and-after measurements.
When to use a z-test in a two-sample test?
- If σ is known, use z-test and use σ instead of s
- If σ is not known but n>30, use z-test
The z-test is appropriate for larger samples where the population standard deviation is known or can be sufficiently estimated.
What is the formula for the t-test with independent samples (assumes unequal variances)?
See image
This formula is used when the variances of the two groups being compared are not assumed to be equal.
What is the test statistic for dependent samples in a two-sample test?
D=X_1-X_2
Here, D represents the difference between paired observations.
How is the t statistic calculated for dependent samples?
See image
What should be reported in papers about t-tests?
- Means of each group
- Some measure of variability (typically standard deviation)
- Mean difference between groups (and variability in differences)
- Info on the test: type of test performed, T-value, degrees of freedom, p-value, possibly confidence intervals
Reporting these elements provides a clear understanding of the analysis and its results.