PSY Exam 2 Flashcards by Trevor Chen

What is Reliability?

the degree which test scores are free from errors of measurement (consistency)

How well did you know this?

Not at all

Perfectly

What is Observed Test Score?

true score plus error (X = T + E)

How well did you know this?

Not at all

Perfectly

What is the Classical Test Theory?

No instrument is perfectly reliable or consistent and all test scores contain some error

How well did you know this?

Not at all

Perfectly

What is Measurement Error?

variations in measurement using a reliable instrument

How well did you know this?

Not at all

Perfectly

What is a Reliable Test?

A test we can trust to measure each person in approximately the samew ay every time it is used

How well did you know this?

Not at all

Perfectly

What are the two types of error?

Random and Systematic

How well did you know this?

Not at all

Perfectly

What is Random Error?

error that is random in nature that lowers the reliability of a test (will lower a test score by exactly the same amount with an infinite amount of testing)

How well did you know this?

Not at all

Perfectly

What is Systematic Error?

error that occurs when a source of error always changes a true score but does not lower the reliability of a test (since the test is already reliably inaccurate it affects descriptive statistics)

How well did you know this?

Not at all

Perfectly

What is the Reliability Coefficient?

an index of the strength of the relationship between two sets of scores (think of spearman-brown formula)

How well did you know this?

Not at all

Perfectly

What are the 3 categories of Reliability Coefficients?

Test-Retest, Alternate Forms, Internal Consistency

How well did you know this?

Not at all

Perfectly

What is Test-Retest reliability?

A test is taken at two different points in time and are compared while facing the challenge of practice effects and fatigue (looking for correlation between scores)

How well did you know this?

Not at all

Perfectly

What is Alternate Forms reliability?

A test developer creates two (or more) different forms of the test (order effects & parallel forms)

How well did you know this?

Not at all

Perfectly

What is Internal Consistency reliability?

A measure of how related items or groups of items on a test are to each other (appropriate for homogeneous tests)

How well did you know this?

Not at all

Perfectly

What is a Homogeneous Test?

a test that is measuring only one trait or characteristic

How well did you know this?

Not at all

Perfectly

What is a Heterogeneous Test?

a test that is measuring more than one trait or characteristic

How well did you know this?

Not at all

Perfectly

What is Scorer Reliability and Agreement?

Study These Flashcards

the amount of consistency among scorers’ judgments (two or more individuals score the same test)

What is interscorer agreement?

Study These Flashcards

the amount of consistency among scorers’ judgments

What is intrascorer reliability?

Study These Flashcards

whether a person is consistent in the way they assigned scores from test to test

What is Interrater agreement?

Study These Flashcards

an index of how consistently the scorers rate or make decisions

What is Intrarater agreement?

Study These Flashcards

when one scorer makes consistent judgments across all tests

What is KR-20 used for?

Study These Flashcards

Estimating the internal consistency for tests whose questions can scored either right or wrong? (ex: true or false)

What is coefficient alpha used for?

Study These Flashcards

Estimating the internal consistency tests whose questions have more than two possible answers (ex: rating scales)

What is Cohen’s Kappa?

Study These Flashcards

an index for calculating scorer reliability / inter-rater agreement when scorers make judgments that result in nominal and ordinal data (-1.0 to 1.0)

What is standard error of measurement?

Study These Flashcards

an index of the amount of uncertainty or error expected in an individual’s observed test score… how much the individual’s observed score might differ from the individual’s true test score.

What is the standard error of measurement formula?

SEM = SD * √(1 - r) | standard error of measurement = standard deviation multiplied by the square root of 1 minus the reliability coefficient of the test

What is a confidence interval?

the range of scores that we feel confident will include the test taker's true score ±1 STD dev of the mean → 68.7 ±2 STD dev of the mean → 95% ±3 STD dev of the mean → 99%

What are the four factors that influence reliability?

test itself, test administration, test scoring, and test takers

What are the 6 factors related to the four sources of test error?

Test length, homogeneity of test questions, test-retest interval, test administration, scoring, and cooperation of test takers

What is the Generalizability Theory?

an approach for estimating reliability that is concerned with how well and under what conditions can we generalize an estimation of reliability of test scores from one test administrator to another

What did Dale say about G Theory?

it is a way of saying we know for reliability you have error (systematic / random) but now there are all of these sources and it will be nice to break apart these sources of error

What is validity?

the extent to which a test accurately measures what it is intended to measure

PSY Exam 2 Flashcards

(32 cards)