Teaching block 3 - Statistics - (weeks 7,8&9) Flashcards
(53 cards)
what are the 2 largest categories for data
categorical and quantative
what are the 3 types of categorical data
binomial - presence / absence
nominal - non ordered groups (yellow)
ordinal - ordered groups eg low med and high
what are the 2 types of quantitative data
continuous and descrete
continuous data can also be interval or ratio data.
what are the 3 main data types in EEA
continuous, descrete and nominal
what practices should be avoided when carrying out statistical tests
cherry picking (selecting best data)
p-hacking (continuous analysis)
harking (hypothesising after result)
fishing experiments (testing between random variables to find an effect)
what are good statistical practices
consider analysis and stats test before data collection
make sure data is quality as well as quantity
document all data prep ands analysis
make sure data meets assumptions of the stats test
what is hypothesis testing
it tells us how different is different enough when looking at two supposedly differing groups.
it gives a chance to reject the null hypothesis
probability distribution facts
area under curve = 1
they describe the probability of obtaining a certain value with in a range
what is a p value
probability of either outcomes assuming the null
choose a significance level (0.05). this is the probability of a false positive (type I error)
when should a chai squared test be used X^2
when both the explanatory and response variables are categorical
give an example of when a X^” test should be used
do humming birds have a preference for flower color.
when setting this experiment up make sure the number of red and yellow flowers is in a ration 1:1
how should a X^” test be carried out
- calculate the expected frequencies under null hypothesis
- quantify difference between expected and observed frequency’s using the X^” equation.
- calculate degreed of freedom
- find p-value with degrees of freedom
- check to see if X^2 value is past the critical value on the probability distribution
what is the chai squared equation
sum of - (observed - expected)^2/expected
what is a degree of freedom?
the number of data points that are free to change independently without changing the statistical parameters of the sample
why are degrees of freedom important
the probability distribution for the p value changes significantly depending on the degrees of freedom in the experiment
what are the limitations of the chai squared test
- there must be more than 5 numbers in each class, if not they should be combined
chai squared checklist
- always calculate observed and expected VALUES not proportions
- make sure null is biologically sound
- use null as basis to calculate the expected values
- always quote degrees of freedom
when should a linear model be used
when the response variable is continuous
when should an ANOVA model be used
continuous response variable and categorical explanatory variables
what does an ANOVA model analyse
analyses the difference (ratio) between among group variation and with in group variation.
what would a significant result look like from an ANOVA test
if the among group variability is greater than the within group variability. this means there is significant differences between groups but not a lot of variation with in the groups.
what is mew
the global mean (across all groups)
what does it mean if each group is very close to mew
there is no relationship between the groups
what is a residual regarding ANOVA
the difference of each data point from the with in group mean