test 2 notes Flashcards by jacob m

Reliability

Property or attribute of consistency in measurement for a test

How well did you know this?

Not at all

Perfectly

Reliability Coefficient

Statistic that ranges from 0 to 1.0

How well did you know this?

Not at all

Perfectly

Classical Test Theory

Variance of any score is due to “true” measurement plus error

How well did you know this?

Not at all

Perfectly

True Score

Reflects a person’s true ability/attribute/trait that test is trying to measure

How well did you know this?

Not at all

Perfectly

Measurement Error

Influence of any other variable that could change a “true” score, e.g., low test reliability

How well did you know this?

Not at all

Perfectly

Error

Actual test score minus the true score.

How well did you know this?

Not at all

Perfectly

Measurement Error Sources

Unsystematic errors and systematic errors

How well did you know this?

Not at all

Perfectly

Unsystematic Errors

Not as serious, from random scores, item selection, test administration, and test scoring

How well did you know this?

Not at all

Perfectly

Systematic Errors

More serious, poor content domain sampling, question on a social intro version test that actually measures general anxiety

How well did you know this?

Not at all

Perfectly

Coefficient Alpha

Ranges from 0 to 1

How well did you know this?

Not at all

Perfectly

Cronbach’s α

Mean of all possible split-half reliability coefficients for a given test. Shows correlation of scores with each other

How well did you know this?

Not at all

Perfectly

Kuder-Richardson-20

Similar to α, but used when test responses are dichotomous (True/False)

How well did you know this?

Not at all

Perfectly

Decisions About People

.90 to .95 Reliability Coefficient

How well did you know this?

Not at all

Perfectly

Research Tests

.80 reliability coefficient

How well did you know this?

Not at all

Perfectly

Tests that seem promising but need more development

.70 reliability coefficient

How well did you know this?

Not at all

Perfectly

Relationship of Reliability to SEM

Study These Flashcards

Lower SEM=Higher Reliability

Validity

Study These Flashcards

A unitary concept that reflects the extent to which a test measures what it aims to measure

Utility

Study These Flashcards

Inferences made from the test are appropriate, meaningful and useful

Construct

Study These Flashcards

A complex psychological concept that cannot be directly measured. Happiness, depression, love.

Face Validity

Study These Flashcards

Questions are clear and understood by
examinee to reflect what’s being tested
or measured. Example: BDI-II. However, face valid tests are susceptible
to response bias.

Content Validity

Study These Flashcards

Degree to which the questions, tasks, or
items on a test are representative of the
universe of behavior the test was designed
to sample. Example: Beck Depression Inventory

Criterion Related Validity

Study These Flashcards

when test is effective in estimating an examinee’s performance on some outcome measure

Criterion

Study These Flashcards

a concrete real-world outcome (e.g., college acceptance, employment status, work produced, grade)

What are the two types of Criterion Validity?

Study These Flashcards

Concurrent and Predictive

Concurrent Validity

criterion is assessed at the SAME TIME as the measure

Predictive Validity

Criterion is assessed some time AFTER the measure

Convergent Validity

Test does indeed correlate with other similar tests or variables as it should

Discriminant (Divergent) Validity

Test does not correlate with other tests or variables as it should not

Beck Depression Inventory (BDI-II)

Cognitive-Affective Factor Guilt, self-criticism, pessimism Attention and concentration problems Loss of interest in enjoyable activities Somatic Factors Tired everyday, loss of energy/motivation Eating and sleeping problems Difficulty completing everyday tasks

Rapport

Testing environment of mutual respect and understanding is crucial to good test scores. Low rapport can cause anxiety, hostility

Examiner Factos

Sex, Race, Experience. Less important than rapport

Rosenthal Effect (Pygmalion Effect)

Lofty expectations of Examiner leads to improved examinee scores Low expectations result in lower scores Expectancy very subtle/unintentional Tester/examiner may subtly convey expectancy to examinee

Examinee Motivation (Response Bias)

Test scores are unreliable if examinee willingly and purposely alters his or her responses during testing

Stereotype Threat

Threat of conforming to a negative stereotype about one's group

Yerkes-Dodson Law (1908)

Principle that moderate levels of arousal leads to optimal level of arousal

test 2 notes Flashcards

(36 cards)