research stats midterm Flashcards
what is biostatistics?
the statistics of medicine, health sciences and public health
define target population
larger population to which results will need to be generalized
define accessible population
actual population of subjects available
define sample
subgroup of accessible population which allows results to be generalized
define parameter
statistical characteristic of population
define statistic
statistical characteristic of sample
define descriptive statistic
describes sample shape, central tendency, variability
define inferential satistic
used to make inferences about a population
define central tendency
the central value
best representative value of target population
single value
define variability
spread of the data
define frequency distribution
the pattern of frequencies of a variable
3 measures of central tendency
mean - average
median - two equal halves
mode - most frequent score
describe skewed to the right
tail faces right
positive skew
mean > median/mode
describe skewed to the left
tail faces left
negative skew
mean < median/mode
when is mean best to use?
numeric, symmetric data
not good for skewed
when is median best to use?
skewed data
not effected by extremes
when is mode best to use?
nominal or ordinal
common in surveys
advantages to mean
easy to calculate and interpret
dont need to arrange values
all values represented
all algebraic formulas possible
disadvantages to mean
cant be used with categorical data
cant calculate if data missing
affected by extremes
advantages to median
easy to calculate
not affected by extremes
can be used with ranked data
disadvantages to median
tedious in large data set
problematic with even number of observations
doesnt account for all values
advantages of mode
easy to understand and fine
not affected by extremes
easy to ID in data set and in frequency distribution
mode is useful for categorical data
disadvantages of mode
not defined if no repeats
not based on all values
unstable when data has small number of values
sometimes could have 2+ or no modes
when would you choose median over mode?
distribution is skewed
researcher is using ordinal data