test 2 notes Flashcards
(36 cards)
Reliability
Property or attribute of consistency in measurement for a test
Reliability Coefficient
Statistic that ranges from 0 to 1.0
Classical Test Theory
Variance of any score is due to “true” measurement plus error
True Score
Reflects a person’s true ability/attribute/trait that test is trying to measure
Measurement Error
Influence of any other variable that could change a “true” score, e.g., low test reliability
Error
Actual test score minus the true score.
Measurement Error Sources
Unsystematic errors and systematic errors
Unsystematic Errors
Not as serious, from random scores, item selection, test administration, and test scoring
Systematic Errors
More serious, poor content domain sampling, question on a social intro version test that actually measures general anxiety
Coefficient Alpha
Ranges from 0 to 1
Cronbach’s α
Mean of all possible split-half reliability coefficients for a given test. Shows correlation of scores with each other
Kuder-Richardson-20
Similar to α, but used when test responses are dichotomous (True/False)
Decisions About People
.90 to .95 Reliability Coefficient
Research Tests
.80 reliability coefficient
Tests that seem promising but need more development
.70 reliability coefficient
Relationship of Reliability to SEM
Lower SEM=Higher Reliability
Validity
A unitary concept that reflects the extent to which a test measures what it aims to measure
Utility
Inferences made from the test are appropriate, meaningful and useful
Construct
A complex psychological concept that cannot be directly measured. Happiness, depression, love.
Face Validity
Questions are clear and understood by
examinee to reflect what’s being tested
or measured. Example: BDI-II. However, face valid tests are susceptible
to response bias.
Content Validity
Degree to which the questions, tasks, or
items on a test are representative of the
universe of behavior the test was designed
to sample. Example: Beck Depression Inventory
Criterion Related Validity
when test is effective in estimating an examinee’s performance on some outcome measure
Criterion
a concrete real-world outcome (e.g., college acceptance, employment status, work produced, grade)
What are the two types of Criterion Validity?
Concurrent and Predictive