module 5 Flashcards
(36 cards)
data fishiness assumptions
- assumption of normality
- assumption of homogeneity of variance
- independence of observation
assumption of normality general definition
scores on the dependent variable within each group are assumed to be sampled from a normal distribution
NHST for evaluating normality general definition
- tests if sample distribution is sig different from normal distribution (same mean and SD)
what tests are used for NHST tests for assumption of normality
- shapiro wilkes test
- kolomogorov smirnov test
skew and kurtosis definition and cut offs
- skew: asymmetry of distribution (0=normal) for descriptive approach >2
- kurtosis: measure of how heavy/light distribution tails are (heavy=high kurtosis/many outliers, light=low kurtosis/no outliers) for descriptive approach >7
- for both, 1.96 or above is non normal
limitations of stat tests of normality
- big difference needed for small samples, small difference for large sample
- non-normality is less of a concern in small samples
- doesnt take type of non normality into account
descriptive approach for evaluating normality definition
- looks at descriptives and or graphic displays to quantify magnitude and nature of non-normality
____ kurtosis is more problematic than ____ kurtosis in t tests, ANOVAs, correlations, and regressions
positive, negative
which approach makes more sense for normality testing; NHST or descriptives
descriptives bc it combines threshold of values and qq plots
thin vs fat tails for normality distributions
- thin: fewer extreme observations than normal distributions
- fat: more extreme observations than normal distributions
if data is normal, scatterplot should resemble a _____
straight line (as opposed to cloud shape)
if the middle of the scatterplot line is straight and the ends flatten, it _____
indicates thin tails and is not problematic
if the middle of the scatterplot line is straight and the ends have a steep slope, it _____
indicates fat tails and is problematic
assumption of homogeneity of variance definition
variance of scores on dependent variable with in each group (condition) are the same across all groups (conditions)
evaluating homo of variance; NHST approach definition
- tests if variances in groups are significantly dif from one another
evaluating homo of variance; descriptive approach
- looks only at imperfection
- looks at descriptive stats and or graphic displays to quantify magnitude of differential variances (largest vs smallest SD)
- looks at threshold ratio of largest to smallest variances
tests for homo of variance
- levenes tests
- hertley variance ratio test or f-max tests
limitations of NHST approach for homo of variance
- role of sample size (dif in variance is less concern for small and more concern for larger sample sizes)
- insensitive to dif in variance in small and sensitive to big
- dif in variance is a magnitude problem
if variances are equal, scatterplot should resemble a straight line with a slope of ___ and the intercept is ____ whereas when the variances are not equal, scatterplot will not cluster around the line and will be different from __
1, the difference between means,1
independence of observation definition
- each observation (between subjects) or set of observations (repeated measures) from the dataset is independent of all other observations/sets
- ex of independance= roommates/partners
positive associations inflate ___ and negative associations inflate ___
alpha, beta
evaluating independence of observation
- examine structural properties of data to see if basis exists for questioning validity of assumption
- if no evident basis, its okay to carry on
- thresholds are up for debate
- if basis exists, independence can be assessed by computing interclass correlation for the part of data that is assumed to have lack of independence
- if correlation is very small (<0.10), its fine to use t test/ANOVA
address violation for normality
- use alt stat procedures that dont need normality
- evaluate level of measurement assumptions
- identity and remove outliers
- transform data to normalize distribution
address violations of homo of variance
- use alt procedures that dont need normality
- evaluate level of measurement assumptions
- identity and remove outliers