Week 10: Measurement Error Flashcards

(33 cards)

1
Q

Measurement error

A

The difference between the observed score and the true score

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

List some sources of measurement error

A
  • the test, if it is poorly designed
  • the test taker and how they are feeling on the day
  • the testing environment
  • cultural factors e.g. language understanding
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Standard error of measurement

A

Provides the necessary information for assessing the measurement error associated with an individuals test performance. How far the mean from a sample is from the true mean of the population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

List in order the test development process

A
  • conceptualisation
  • construction
  • tryout
  • analysis
  • revision
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Conceptualisation

A

How do you define a test? What are you measuring? What is useful and what is outside the bounds of this?

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Construction

A

Essentially developing the items, and considering why items are important for the test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Tryout

A

Collecting test data and qualitative feedback on the test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

How is best to measure knowledge

A

Yes/no, multiple choice, short answer

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

How is best to measure degree of symptoms/attitudes

A

Likert scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

How is best to measure health

A

Visual analogue scale

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

How is best to measure neurological function?

A
  • visual items
  • memory
  • visuospatial tasks
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

How is best to measure personality?

A

Q-sort

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

List some important considerations when designing a test

A
  • generate an item pool
  • short and concise items
  • age appropriate level
  • no double barrelled items
  • include negatively worded items
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

Dichotomous

A

Two answers; Yes/no, true/false

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Polytomous

A

More than 2 alternatives, like a multiple choice

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

Likert

A

Typically 5 items ranging from strongly disagree to strongly agree

17
Q

Likert type format

A

“Never”, “rarely”, etc.

18
Q

Q-sort

A

Cards with descriptions that need to be sorted via categories. Can be useful for clients but not for large samples.

19
Q

Describe semantic differential

A

Where a person falls between two opposing poles by marking a line

20
Q

Describe visual analogue

A

Indicating where you fall on a line

21
Q

Benefits of reverse scored items

A

Bias is controlled for and requires extra concentration from participants

22
Q

Category format

A

e.g. ‘On a scale from 1-10..’

23
Q

Some characteristics of category format

A
  • has situational effects
  • need clearly defined endpoints
  • increasing categories above 10 can decrease reliability
24
Q

Classical test theory

A
  • true score = observed score + error

- interested in total scores, of which one proportion is error and one proportion is true scores

25
Item response theory
More about individual items and focuses on looking for a characteristic curve.
26
The item characteristic curve
Describes the relationship between latent ability and performance on a test item
27
What will the item characteristic curve ideally show
Someone who has a high ability should have a higher chance of doing well than someone who has low ability
28
Describe criterion reference tests
Look at how well someone performs relative to others. Having a norm referenced test as the one we use most commonly.
29
List the various response item formats
- dichotomous - polytomous - MC - likert - likert type - Q sort - true false - two choice - forced choice
30
Describe item difficulty in terms of item analysis
The % of people who get an item correct informs the difficulty level, for example if 69% of individuals get it right, the difficulty level for that item is set at .69
31
A good test...
- discriminates at many levels | - has varying degree of difficulty
32
Extreme group method
The top 33% is compared to the bottom 33%, subtracting the proportion of items correct. The top proportion should have more items correct than the bottom.
33
Point biserial method
Correlation between performance on particular items and performance on whole test