final Flashcards
face validity
does a test appear to measure what it was designed to measure; lay-person judgement
how do content & face validity differ?
content involves systematic and technical analysis
face is more superficial
criterion validity
the extent to which a measure agrees with a gold standard; whether it matches a measure of some attribute or outcome that is of primary interest (criterion)
types of studies: criterion validity
predictive vs concurrent
predictive studies
take the test today and test the criterion some time down the road
drawbacks to predictive validity studies
time, money, issues from time lag
concurrent studies
test and criterion done at the same time
when should you use predictive vs concurrent studies?
if goal is prediction -> predictive
if goal is to determine current status -> concurrent
criterion contamination
when criterion measures more dimensions than by those by test
do scores on the predictor influence criterion scores?
techniques for interpreting validity coefficients
(1) sig level; did not occur by chance (p value)
2) coefficient of determination (R^2
what if your validity coefficient is small?
if a test provides info that helps predict criterion performance better than any other existing predictor the test may be useful even if coefficient is relatively small
linear regression
a mathematical procedure that allows us to predict values on one variable if we know values on the other
standard error of estimate
a stat that reflects the average amount of error in our prediction and that allows us to make confidence statement
decision theory models
when tests are used for making decision such as personnel selection; factors others than the correlation between test and criterion are important
decision theory models: selection ratio
proportion of applicants needed to fill position
decision theory models: base rate
proportion of applicants who can be successful candidates
model sensitivity
metric that evaluates ability to predict true positives of each available category
A/A+C
A= true positive C= false negative
model specificity
metric that evaluates ability to predict true negatives of each available category
D/B+D
B= false positive
D=true negative
evaluating validity coefficients
- look for changes in the cause of relationships
- what does the criterion mean?
- review the subject pop of validation study
- be sure sample size was adequate
- never confuse criterion with predictor
- check for restricted range on both predictor and criterion
- review evidence for validity generalization
- consider differential prediction
construct validity
extent to which evidence can be provided that test measures a theoretical construct
Campbell & Fiske’s types of validity evidence
convergent and discriminant
types of convergent evidence
(1) does test measure same thing as other tests used for same purpose
(2) does test correlate with specific variables that we can expect if it is doing its job
validation study
two or more constructs measured in two or more ways
what can validation studies tell us?
convergent and discriminate validity
homogenity and unidimensionality