Me Myself and I Flashcards
(43 cards)
Why is biology considered a quantitative
Biology involves collecting and analyzing quantitative data to test hypotheses and make prediction. Biological research relies on quantitative measurements of different parameters.
Why is understanding data so important?
There is a lot of variation within biology, by understanding data, we can establish causes and make predictions. We can establish cause-effect relationships by manipulating variables and understanding the resultant data.
Why aren’t excel files used to collect data?
Excel files cant be easily opened and used in other software, CVS files are easier to work with.
How can graphs and statistics help you understand data?
Graphs allow you to visualize data, statistics allow you to summarize data.
What is the difference between samples and populations?
A sample is usually only a subset of the total population. A population is all the members of a defined group.
How can we make sure our sample is representative of the population?
Sampling bias is the result of poor experimental design and biases the results. In order to avoid this, the sample should be as representative of the entire population as possible.
What is sampling error?
The random variation introduced into a data set as a result of only sampling a subset of the population. The results from your sample may not be applicable to the entire population.
What type of data collection may create bias?
Self-reported data can create bias due to inaccurate reports.
Why is statistical testing important?
Its not enough to look at the data and make assumptions, it is important to statistically test the reliability and significance of our findings.
What is a statistical hypothesis?
A statistical hypothesis is a statement or assumption about the characteristics of a population or the relationship between variables that is subject to statistical testing
What is the null hypothesis?
The default expectation that categorical outcomes are equally likely, and there is no relation between two measured phenomena or that there is no association between groups.
What is an alternative hypothesis?
The expectation that categorical outcomes are not equally likely, that there is a relation between two measured phenomena or an association between groups
When should you used a chi-squared test?
When comparing two categorical variables e.g. diabetes prevalence amongst males and females
When should you use a t-test?
When comparing one categorical variable with one continuous variable- compares the means of both groups e.g. blood pressure amongst males and females
When should you use a general linear model?
When trying to establish a relationship between two continuous variables e.g. height and finger length.
What is statistical significance?
Statistical significance is the claim that the produced results would be very unlikely under the null hypotheses, so there is a relationship or association between variables.
What is the p-value?
P-value is a measure of statistical significance, it is the probability of the shown results occurring under the null hypothesis
What is the p-value threshold?
The p-value threshold is 0.05. If a p-value is lower than this, results are generally considered statistically significant.
Why does a very low p-value indicate more significance?
The lower the p-value, the less likely the results would occur under the null hypothesis. A p-value of 0.05 suggests there is a 5% chance the data would show randomly under the null hypothesis.
Why may a p-value very close to the threshold not be reliable?
A p-value very close the the threshold may be due to a sampling error. If this is the case, bigger more representative samples may help reduce sampling bias
What is a type one error?
False positive, this is when results provide evidence against the null when it is true.
What is a type two error?
False negative, this is when results provide evidence for the null when it is not true.
What is effect size?
Effect size is the magnitude of the effect seen in the results. The p-value may be very low but the effect may be negligible e.g. drug trial shows very low change in variable is not useful. Effect size with continuous variables measures the strength of the association and the gradient of the line.
Why is biological context important for interpreting data?
Biological context helps interpret data correctly, can give a explanation to why a result has occurred e.g. spikes in names such as harper when Beckhams daughter was born, or drops in names such as alexa.