Exam revision questions Flashcards
(106 cards)
What are the assumptions for t-tests, chi-squared tests, ANOVA and regression?
What is the effect size measure for a chi-squared test of independence?
Cramer’s V.
what is the difference between a one-sample and two-samples (independent and paired) t-test?
A one-sample t-test is comparing a sample mean to a value, such as a known population mean or a specific value.
A two-samples t-test is when we are comparing to sample means.
If the assumption of normality of variables is violated, what can you do if you wanted to do a one-samples t-test originally?
You can do a Wilcoxon test. Wilcoxon tests compare data by ranks and not actual values.
What does a p-value represent?
Can we say the null hypothesis is true or false?
A p-value tells us the probability, if the null hypothesis is true, of observing a test stat at least as extreme as ours.
We cannot claim whether the null hypothesis or the alternative hypothesis are true based on p-values. We can only draw inferences about how likely it is that either are true.
What is the standard deviation of the sampling distribution of the mean?
The standard error of the mean, or SEM = sample sd/square root of the sample size.
What factors influence power?
What is power?
Power is when we reject the null hypothesis and the null hypothesis is actually false. It measured by 1-beta, where beta is the type II error rate.
Power is dependent on sample size, alpha and effect size.
Can you say that p is the probability that the null hypothesis is true?
No!
The p-value is how likely you are to see your data IF the null was true.
What are the assumptions of a chi-squared test?
- Large frequencies - at least 5 in each cell.
If violated use Fischer Exact Test. - Independence of data - no one contributed for than one piece of data.
If violated used McNemar test.
What is the common effect size used when doing chi-squared tests?
Cramer’s V.
The higher Cramer’s V, the more likely the variables are associated, not independent.
Think chi-square test for association or independence.
By definition do all z-scores have a mean of 0 and standard deviation of 1?
Yes.
I don’t quite understand what that means.
How do we come up with the t-distribution? Even if we know the population mean we do not know the standard deviation…
The t-distribution is created by taking the population mean then averaging over lots of different possible population sds. As N increases, we become more accurate at predicting population sd and the t-distribution becomes more normal and tighter.
As the t-distribution is dependent on the degrees of freedom, is it true that whether a t-statistic is significant depends on the sample size?
Yes.
A given value for a t-stat may be significant for a sample of 20, but not for a sammple of 10.
Do the degrees of freedom used in a Welch t-test of independence take into account just how different the variance is within each groups/sample?
Yes.
What type of t-test assumes homogeneity of variance?
What type of t-test takes into account the different standard deviations of the samples?
Student independent samples t-test.
Welch independent samples t-test.
What are the assumptions made for an independent samples t-test?
- Samples distributions are normal.
- Data are independent.
- Variance of samples are the same if using a student independent samples t-test. If violated, which it normally is, then use Welch independent samples t-test).
When do we use a paired-samples t-test?
When we are interested in the difference scores, not just whether two means ARE different.
Examples would be in repeated measures designs, such as pre- and post-treatment. Also, if the there is a common object in each group, such as two people giving ratings for the same set of hats and we wanted to know whether their mean rating of the hats overall differed.
One of the assumptions of all t-tests is that the data in each group are normally distributed. How do we test this both qualitatively and quantitatively?
QQ plots can be used to qualitatively check this.
Shapiro-Wilk tests can be used to quantitatively assess this.
What does a Shapiro-Wilk test with a W less than 1 and a p < .05 suggest.
That the data are not normally distributed.
What is the null hypothesis for a Shapiro-Wilk test?
That the data are distributed normally.
Is it true that Shapiro-Wilk tests will often be significant, i.e. imply non-normality, even if the data are normally distributed?
What can we do check normality if we have a large sample size and Shapiro-wilk is coming significant?
Yes.
If sample size is over 40-50 then Shapiro-Wilk test likely to be significant.
We can look at QQ plots and histograms to assess whether there is a normal distribution of data.
For an independet samples t-test you need to check the normality of each group. When checking the assumption of normalcy for paired-samples t-test, what group/s are we testing?
The difference variable. This is because the test we are doing is essentially a one samples t test on the difference variable.
What are non-parametric tests?
Statistical tests that do not make assumptions about the distribution of data.
What are some limitations with non-parametric tests?
They are not as powerful, i.e. they have higher type two error rates.