Types of data
Qualitative which is categorical
Quantitative = numerical
Types of qualitative data
Nominal - named - eg. ethnicity, blood groups
Ordinal - ordered/ranked eg. Pain score 1-10,
Types of quantitative Data
Discrete - whole numbers eg. Number of Antenatal visits
Continuous - any number eg. Birthweight
Descriptive statistics
Measures of central tendency :Mean median mode
Measures of dispersion: Range, interquartile Range, Standard, deviation
Normal distribution and skewness
Mean
Average
Good for symmetric distributions
Median
Middle value
If two numbers are in the middle then take the average of those
Mode
Most common number in the dataset
Range
The difference between the highest and lowest values
Formula = maximum - minimum
Interquartile range
Def: the range of the middle 50% of data between Q1 and Q3
Formula IQR = Q3-Q1
SD - what is standard deviation
Measure of how much data is deviated from the mean
Pro is uses all the data points
What is normal distribution
If mean median and mode are all the same so it’s symmetric
Follows the 68, 95,99.7 rule
68% of data whitin 1 SD from mean
95% within 2 SD
99.7% within 3 SDs
Ex. ID scored mean 100, SD 15
68% of ppl have IQ 85-115
95% have IQ 70-130
What does it mean if you have a large and small SD
Small SD =data concentrated around mean , long bell
Larger SD=wider spread of data away from the mean ,wide bell
Right skewed distribution (positive )
Ex. Income many earn 40 grand but ear millions so would pull the mean higher than median
Mode median mean
Left skewed example
Negative
Ex age of retirement
Mean median mode
Parametric vs non parametric
In clinical research if uncertainly about the distribution of a test
Which one do you use
Use a non parametric test
What is the non parametric version of the
unpaired t test or
independent t test or
students T test
Mann U Whitney
Compared 2 independent samples from the same population
ex. Compare average time of del between kiwi and forceps
What is the non parametric of
One sample T test or
One sample paired T test
Wilcoxon matched pairs t test
Compared 2 sets of observations on the single sample (before and after)
What is the non parametric version of
ANOVA - one way analysis of variants using total sum of squares
Kurskall-Wallis
Compared 3 or more sets of observations on a single sample paired T (compare decision -del time in the end stage of labour for vetouse, forceps and kiwis)
What is the non parametric version of
Chi-square test
Fisher’s exact test for <10 number , smaller number
Chi square is for >/+ 10 sample size
used to determine whether there is a significant association between two categorical variables.
⸻
✅ What does it do?
The Chi-square test checks whether the observed frequencies in a dataset differ significantly from what we’d expect by chance.
For example: relationship between obese mom ( obese / non obese) and PET (present / absent )
What is the non parametric version of
Pearson correlation coefficient
Spearman’s rank
Assess the stregth of the straight line association between 2 continuous variables
Ex. If HBA1c is related to birth weight in a diabetic mom
What multiple logistic regression
Calculates the relationship between variables eg. Birth weight and several independent or predictor variables like age, smoking , parity
What do inferential statistics include
Hypothesis testing (null and alternative hypotheses)
P-value and significance - typically p<0.05
Confidence intervals (CI) and their interpretation