Point above which have of the observations fall - not affected by outliers - more accurate than the "average" in skewed distributions

S - square root of the sample variance - estimates average variation of n-values from the mean - tells us how much variability is expected among individuals

Numbers and % of subjects in each category - median - smallest and largest - values/range

Graph and visualize the distribution - mean/median/mode - smallest and largest - values/range - percentiles - variances - standard deviations

Statistics/Stat Approaches Flashcards by Nicolaas Bloemen

Mean

Addition of all samples divided by number of samples

How well did you know this?

Not at all

Perfectly

Median

Point above which have of the observations fall

not affected by outliers
more accurate than the “average” in skewed distributions

How well did you know this?

Not at all

Perfectly

Mode

Most common observed variable

How well did you know this?

Not at all

Perfectly

Right Skewed Distribution

Tail is on the right hand side

How well did you know this?

Not at all

Perfectly

Left Skewed Distribution

Tail is on the left hand side

How well did you know this?

Not at all

Perfectly

Measures of spread (2)

1) Range
2) Percentile
3) Variance
4) Standard Deviation

How well did you know this?

Not at all

Perfectly

Box and Whisker plot contains: (7)

1) High outlier
2) Maximum whisker
3) Upper quartile (Q3)
4) Median
5) Lower quartile (Q1)
6) Minimum whisker
7) Low Outlier

How well did you know this?

Not at all

Perfectly

Variance

S^2
is the sum of the squares of differences from the mean / degrees of freedom minus 1

S^2 = (difference)^2 + (difference)^2 / (n-1)

How well did you know this?

Not at all

Perfectly

Standard Deviations

square root of the sample variance
estimates average variation of n-values from the mean
tells us how much variability is expected among individuals

How well did you know this?

Not at all

Perfectly

Binary or dichotomous data

numbers or % in each category

yes/no style answers

How well did you know this?

Not at all

Perfectly

Nominal Data

number and % of subjects in each category

How well did you know this?

Not at all

Perfectly

Ordinal Data

Numbers and % of subjects in each category

median
smallest and largest
values/range

How well did you know this?

Not at all

Perfectly

Quantitative Data

Graph and visualize the distribution

mean/median/mode
smallest and largest
values/range
percentiles
variances
standard deviations

How well did you know this?

Not at all

Perfectly

How are confidence intervals derived and interpreted

Ex) 95% CI

The interval from ___ to ___ has a 95% chance (probability) to contain the true population mean
greater sample size = smaller CI

How well did you know this?

Not at all

Perfectly

Concept of hypothesis testing and steps (3)

Hypothesis testing involves comparison of groups
1) Test statistic (t-distribution, z-dist, f-dist)
2) P-value
3) p-value compared to alpha (0.05, 0.1, etc)

How well did you know this?

Not at all

Perfectly

Interpretation of p-values

Study These Flashcards

P-value is an indication of the “data occurring” if the null was true

p < alpha = reject Ho
p > alpha = Do not reject Ho

How are Chi-squared tests derived from 2x2 tables

Study These Flashcards

2x2 tables are expanded to show observed vs. expected numbers

Chi-squared uses differences in observed vs. expected numbers to calculate chi-squared statistic

Alternate hypothesis

Study These Flashcards

Ha: two groups are different

What does it mean when a CI for means/risk difference contains 0?

Study These Flashcards

Means it is non-significant

What does it mean when CI for odds ratio/risk ratio contains 1?

Study These Flashcards

Non-significant

What does it mean when CI for odds ratio/risk ratio contains 1?

Study These Flashcards

Non-significant

Parametric Tests (3)

Study These Flashcards

When data is normally distributed

1) T-test
2) ANOVA
3) Regression

Non-Parametric Tests (2)

Study These Flashcards

When data is not normally distributed

1) Wilcoxon rank test
2) Kruskal-wallis test

T-test

Study These Flashcards

Parametric test

Test whether the mean of a sample or population is different from a particular value
one sample or one group OR
two groups (2 sample t-test)

ANOVA

- Parametric test ( continuous normal distribution) - Test equality of the means between 2 or more populations - groups must have equal variance

Paired T-test

Same organism used for 2 or more observations - Parametric test - continuous and normally distributed - 2 comparison groups but one organism

Linear Regression

- outcome continuous, normally distributed - Parametric test - 1 or more predictors leads to 1 outcome Time = race distance + sex

Wilcoxon rank test

- Non-parametric data (continuous, not normally distributed) - test mean of sample/pop. is different to particular value - equivalent to 1 sample t-test, or 2 sample t-test - can also be used for paired samples/populations

Kruskal-Wallis test

Non-Parametric data (continuous, not normally distributed) - used to test equality of MEDIANS or two or more samples/populations - 2 or more comparison groups - Equivalent to ANOVA

Homoscedasticity

Equal variance

Logarithmic Regression

Not normally distributed outcome Outcome: Dichotomous/binary 1 or more predictor leads to outcome odds ratio converted to probability

Logarithmic Regression

Not normally distributed outcome Outcome: Dichotomous/binary 1 or more predictor leads to outcome odds ratio converted to probability (odds) heart disease (yes/no) = age (years) + family history (yes/no) + smoking (years)

Statistics/Stat Approaches Flashcards

(32 cards)