Quantitive analysis Flashcards
why is statistics important?
- way to describe and measure biology
- allows us to estimate an unknown quantity using our sample
- test hypotheses
is stats math?
kinda? helps us to define what we know, how we know it and how strong the evidence is
what do we use descriptive stats for
summarize and describe the data
describe patterns
tables, graphs and charts
- the first step of making inferences
what are the 4 types of descriptions of data and their two categories
Categorical
1. nominal
2. ordinal
continuous
1. interval
2. ratio
What is nominal data?
- at type of categorical data
1. non - ordered categories - sex
- blood type etc
what is ordinal data ?
type of categorical data
- ordered categories
eg: house hold income - lowest to highest
eg: age
how do we display nominal data?
- frequency tables
- bar graphs
how do we display/describe continuous data?
- frequency table
- histogram
- parameters - mean, median etc
what is a frequency table?
Create artificial groups and report the frequency
* e.g. age
* This is not very informative and throws out information
what does a histogram show?
the shape of the data,
how to describe continuous variables?
- when measuring central tendency
a)mean
b) media
c)mode - measuring of variability or spread
a)range
b)percentiles
c) variance/standard deviation
what is a central tendency?
the average distribution.continuous variable
summarizes the data with a single value
what can the mean be distorted by?
outliers
what is the median?
middle of data
order from smallest to largest and average two middle points
what is the mode?
number that occurs most often in the data
can be more than 1
not impacted by outliers
when to use mode, median and mean
mode - rare
mean - if your sample is normal - bell
median - if your sample is skewed
what is the range?
smallest to largest values
what are the percentiles? -hard
- a measure of the dispersion for one variable that indicates the % of cases at or below a score or point
eg: the median is the 50th percentile
eg: 25th percentile is the value at which 25% of the distribution have that value or a lower one
often report quartiles - 25th, 50th, 75th
what is standard deviation/variance?
- the measure of dispersion for one variable that indicates an “average distance” between the values and the mean
what is generally used to show continuous data visually?
boxplots
What do boxplots show? - parts of it
- the median
- the IQR - intervals q1 q2 etc
- the range or the outliers
if the median is the centre of a boxplot what does it mean
that you have a bell-shaped curve
what can boxplot diagrams be used for?
- to compare different groups
what are the types of prevalence?
- point prev
- period prev
- lifetime prev