16 Putting it all together Flashcards

1
Q

What’s the difference between a valid measure and a reliable measure (target analogy)?

A

A valid measure always hits the target; a reliable measure always hits the same spot on the target.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Reliability is a ________ but not ________ condition for validity.

A

Reliability is a NECESSARY but not SUFFICIENT condition for validity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

At the conceptual level, a _____ measure is always reliable

A

At the conceptual level, a valid measure is always reliable.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Fuzzy reliability/validity distinction. Cronbach’s alphas can be an index of ______ ________ ________ or _________ ________ _______

A

Cronbach’s alphas can be an index internal consistency reliability or internal consistency validity (related to construct validity)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Fuzzy reliability/validity distinction. Parallel-forms reliability also assesses what kinds of validity?

A

Concurrent validity and to a degree convergence validity.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Assessment of inter-rater reliability is closely related to what form of validity?

A

Content validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Test-retest reliability can be used an index of __________ validity

A

Test-retest reliability can be used an index of external validity

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What happens to standard error of measurement when you increase sample size?

A

It gets smaller. SEM is inversely proportionate to the square root of sample size.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What’s the difference between Q (respondent-centred) and R (stimulus-centred) analysis?

A

Q (respondent-centred) analysis examines systematic variation across respondents –i.e. how responses differ between people.

R (stimulus-centred) analysis focuses on how responses differ between stimuli (test items).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What could be the problems caused by non-discriminating items?

A

Invariable responses, ceiling/floor effect.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are serial effects?

A

The effect of previously answered questions on subsequent questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What are homogenous and inversely keyed items?

A
Homogenous: 
"I go to parties" 
"I go to social gatherings"
Inversely keyed
"I tend to go to parties"
"I avoid parties"
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

Why do homogenous and inversely keyed items useful?

A

Because they detect invariable or random responses.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is generalisability theory?

A

A statistical framework developed by Cronbach for determining how well scores can be generalised to another setting. It attempts to understand all the sources of variation –called facets –(time, persons, raters, setting) and predict scores based on the variation of these facets.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What’s the main difference between generalisability theory and classical test theory?

A

Classical test theory has just one error term:
X = T + E
Generalisability theory allows for various sources of error which may vary independently.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What is Item Response Theory?

A

Item Response Theory is the study of test and item scores based on assumptions concerning the mathematical relationship between abilities (or other hypothesized traits) and item responses. It models the relationship between a respondent’s ability and the probability of getting a correct response on an item. I.e. smart – more likely to be correct. It can then be used to predict scores from one test to another, and for other purposes.