Lecture 8 - ANOVA Flashcards
What is does ANOVA test?
The difference between two or more population means
What kind of test is the ANOVA? parametric or non-parametric
Parametric test
What are the 4 assumptions of an ANOVA?
1) Random sampling
2) Homoscedasticity (=equal variances)
3) Independent measurements or observations
4) Normal distribution
What is a variable?
a variable is what is measured by experimentalist = response or dependent variable
What is a Factor?
The effect under investigation = independent variable
ie. salinity, temperature, etc.
What are Factor Levels?
different treatment levels in an experiment it is something that the experimenter varies
ie. PCB or temp at various levels
What are the 2 types (and sub-types) of ANOVA’s?
1) Univariate - one variable (response) measured
sub-types: one way (one factor) or multi-way (two or three factors)
2) Multivariate - more than one variable measured
What are two main sources of variation?
1) between sample or population means = factor
2) Within samples or populations = error
What is the variance in ANOVA?
is the difference between the population means high or low
What kind of output do we want with sources of variation in regards to ANOVA?
We want to see a high factor variance between factors and a low error within the samples or populations
When are samples unlikely to come from the same population?
If the variation between the sample means is large relative to the variation within the samples
What does accuracy mean?
Accuracy means we know the true value. However this is not often the case
If the means are almost the same, what happens to the residuals?
The residual becomes zero
What is the ability to detect change in the response?
sensitivity and is related to the number of levels
high sensitivity is detecting change
What happens when sample (level) means are close together?
high internal variability
What is precision related to?
related to the error, and repeatability of the experiment.
What happens when precision is close?
high repeatability
What happens when precision is far?
low repeatability
What happens with the null hypothesis when there is high within variance and low between?
Get closer to accepting the null that the populations come from the same population (there is no real difference between the populations
What happens to the null hypothesis when there is high within variance and higher between variance?
End up mid-way between accepting and rejecting (not significantly accepting or rejecting)
What happens to the null hypothesis when there is low within variance but higher between variance?
Reject the null and the two means/populations are in fact different from each other. No overlap with the between variance means significantly different
Why was the Crakenback river Univariate ANOVA criticized?
The sites are not independent of each other because they flow into each other and the same statistics that were designed for manipulation were used for a monitoring study
What was the result of the Crackenback river sites (factors) not being independent?
Huge bias and pseudoreplication
What is pseudoreplication?
Treating data that is dependent as independent