week 8 Flashcards

1
Q

What is the idea behind classical test theory?

A

Observed score = true score + measurement error
X=T+E.

if scores on the same test correlate, even throughout different times, raters etc, then reliability is good.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What is standard error of measurement?

A

SEM indicates the level of measurement error associated with (an individual’s) test score. It is simply the SD of measurement error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

What are the five stages of test development?

A

Test conceptualisation, test construction, test-tryout, analysis, revision

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What is a dichotomous measure

A

Only two options, like yes/no, t/f

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What is a polytomous measure

A

2 or more alternative answers

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What is a forced choice format

A

When theres an even number of questons, with no middle ground (i.e. no ‘neither agree nor disagree.’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What does item difficulty mean in this context?

A

Applies to tests of achievement or ability, the % of people who get them correct. good tests have a large spread (HD, D, C, fail etc)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is Discriminability?

A

When you takr the top and bottom 33% of your group and see which ones they got right and wrong. Also known as a point biserial method. Correlation between performance no particular items (dichotomus) and performance on whole test (continuous)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the item characteristic curve?

A

A fucking irritating statistical thing.
So on a plot, the X axis is theta. This is the thing you are trying to measure. Y axis is probability of response to ‘Yes’ for item.
So, we look at this curve, and at p=.05, thats where people go from saying no to yes. the sharper the curve the more abrupt the transition.
‘probability of endorsement’

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

criterion-referenced tests

A

performance measured against pre-determined critera (driving tests)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

norm-referenced tests

A

performance measured relative to others (like intelligence)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly