statistics and data analysis Flashcards
(39 cards)
what is a nominal variable
data in the form of labels or names
but without a natural order or ranking
e.g. eye colour
it is a type of categorical variable
what is an ordinal variable
give an example
a named variable
which has a natural order e.g. ‘level of satisfaction’ or ‘degree of pain’ or ‘socioeconomic status’ or ‘workplace status’
what is an interval variable
give an example
named variable
with a natural order
AND equal intervals between variables
e.g. temperature, credit scores, SAT scores
but NOT a ‘true zero’ value (that’s a ‘ratio’ variable)
for temperature, you can have negative degrees celcius, so there’s no zero that the value cannot go beneath
you cannot have a credit score of negative, or of zero
what is an ratio variable
give an example
named variable
with a natural order
AND equal intervals between variables
AND a ‘true zero’ value
e.g. height, weight, length
e.g. your height could technically be zero but NOT negative
equally with weight
how to calculate standard error of the mean
SD / square root of sample size
how to calculate BMI
kg / (m^2)
weight / height in m / height in m
obesity class 3 is
40+
obesity class 2 is
35 - 39.9
obesity class 1 is
30 - 34.9
overweight is BMI
25 - 29.9
Normal BMI is
18.5 - 24.9
underweight BMI is
<18.5
how to calculate odds ratio
Divide the probability that the event will occur by the probability that it will not occur
In other words, it’s a ratio of successes (or wins) to losses (or failures)
Odds ratio = p / (1 - p)
how to calculate standard deviation
square root of the variance
what does a low p-value mean
there is strong evidence to reject the null hypothesis
i.e. it’s likely that your results are due to a true effect and not just due to chance
so a low p-value is usually exciting for your experiement :)
what is a t-test
assesses whether the means of two groups are statistically different from each other
if you are comparing 2 different diets (for example), which test would you use?
if you are comparing 3 diets, which test would you use?
2 - two sample t-test
3 - ANOVA
What does a p value of 0.05 mean?
The p value is a proportion: if your p value is 0.05, that means that 5% of the time you would see a test statistic at least as extreme as the one you found if the null hypothesis was true.
what is a Forest plot?
the way you represent data in a meta-analysis
Forest plots vertical line is…
what value is the line at?
the line of no effect
if a RR or OR, the line is at 1.0
for AR or SEM, the line is at 0
i.e. no difference between 2 interventions
or no association between an exposure or an outcome
Forest plot horizontal line at each point is…
the confidence interval for that study in the meta-analysis
(a Wider confidence interval is Worse.
Or a Longer line is Less reliable data)
on a Forest plot, a bigger box means…
a larger sample size
as these are more weighted, the confidence interval tends to be narrower too.
the diamond-shaped box at the bottom of the Forest plot is…
the lateral tips of the diamond represent…
the weighted average of all the studies
the lateral tips of the diamond are the confidence interval of the weighted average of all the studies
If the diamond at the bottom of a Forest plot touches the vertical line, that means…
the results of the meta-analysis show the thing being studies is not statistically significant