Quiz 2 Flashcards
didn’t study, anxious, late to test
trait error
error that resides in testing situation (noisy, hot, etc.)
Method errors
what is the formula for test score theory
X (observed score) = t (true score) + E (error)
What is the formula for gregory’s intelligence scale?
(Lf + Ll)x100 divided by
10
___are used to estimate reliability.
Correlation coefficients
ranges from -1.0 to +1.0
Correlation coefficients
is expressed from 0.0 (no reliability) to +1.0 (perfect reliability)
reliability coefficient
indicates the proportion of variance in a group of obtained scores that is attributable to true individual differences.
A reliability coefficient
A reliability coefficient is directly interpretable. A test with a reliability of .90 has a ______ error.
.10
What are the 4 types of reliability
test-retest
parallel form
internal consistency
interrater reliability
-Stability of test scores over time
-It requires two administrations of the same test with the same group of individuals
-Correlate scores from one test taken at two different times.
Describes what type of reliability?
test-retest reliabiltity
-Used when there is more than one form of a test
EX – SAT, Act, GRE, MCAT, Make-up Exams
-Reduces the possibility of coaching or cheating and memory or practice effects (minimizes but does not eliminate)
-measures degree that two forms of a test are measuring the same thing.
Describes what type of reliability
Parallel Form Reliability
When you want to know if the items on a test assess one, and only one, dimension. The majority of psychological tests have only one form.
-correlate each individual item with the total score
Describes what type of reliability?
internal consistency reliability
A single test is administered to a group of people
Items are divided into equal halves, typically odd-even (good especially b/c many tests get harder as test progresses)
Correlation between items is the split-half reliability
Have to be careful with speed tests, especially if first items are easier
Split-half reliability
When you want to know whether there is consistency in the rating of some outcome. describes what type of reliability?
examine agreement between raters
interrater reliability
How do we increase reliability
One way is to increase the number of items (typically works), will increase the range of test scores
EX – think of luck involved on a 5 item test vs. 100 item test.
The average amount of variability in a set of scores (average distance from the mean) is called
standard deviation (s or sd)
If s=____, there is no variability, numbers are identical in nature
0
_____is sensitive to extreme scores, just like the Mean.
Standard deviation
The Standard Deviation squared (or don’t compute last step of SD) is the ____
variance
Second measure of internal consistency takes it a step further than split-half
Split-half has been criticized for lack of precision – reliability changes based on how items are split
Why not take a more typical value such as the mean of the split-half coefficients for all possible splitting of a test?
Used with dichotomous data- items scored as right or wrong dichotomous (0 or 1)
Kuder-Richardson Reliability
mean of all possible split half coefficients, corrected by the Spearman-Brown formula.
Used for tests with continuum such as LIKERT
Must have high reliability coefficient (items must be homogenous – measure the same trait)
cronbach’s alpha (coefficient alpha)
An advantage of the median over the mean is:
A) it is sensitive to extreme scores
B) it is less influenced by extreme scores
C) it is more accurately reflects the central tendency
B) it is less influenced by extreme scores
The mean is: A) the most frequently occurring score B) the midpoint of a distribution of scores C) least affected by extreme scores D) the arithmetic average
D) the arithmetic average