test theory & practice Flashcards

(78 cards)

1
Q

maximum performance test

A

asks the person to do his her best to solve one or more problems, intelligence and achievement tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

typical performance test

A

asks the person to respond to one or more tasks, where the responses are typical for the person. personality or attitude tests

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

dimensionality of a test

A

is equal to the number of latent attributes (variables) which effects test performance

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

latent

A

unobserved

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

unidimensional test

A

test that measures one latent attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

multidimensional test

A

test that measures more than one latent attribute

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

mental test

A

consists of cognitive tasks, such as problems and questions

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

physical test

A

consists of instruments to make somatic or physiological measurements

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

a pure power test

A

consists of problems that the test taker tries to solve, test taker has ample time to work on each of the test items . emphasis is on measuring the accuracy to solve the problems

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

ability test

A

sometimes also called aptitude test, is an instrument for measuring a persons best performance in an area that is not explicitly taught in training or educational programs.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

achievement test

A

an instrument for measuring performance that is explicitly taught in training and educational programs

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

other evaluation mode

A

to ask others to evaluate a persons ability to perform a task

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

description

A

means the the test is only used to describe performances.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

dichotomous scale

A

where test takers responses are graded in two ordered categories, that is correct or incorrect

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

the ordinance polytomous scale

A

where test takers responses are graded in more than two ordered categories, for ex. a correct, partly correct or incorrect answer (or a b c d e )

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

a measurement procedure is reactive when

A

test takers can deliberately distort their construct value, for ex an unmotivated student who pretends to be highly motivated in a self report school attitude test

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
17
Q

measurement procedure is nonreactive when

A

test takers cannot distort their construct value, for ex drivers whose record shows many traffic offences cannot disguise that their record is indicative of a negative attitude towards traffic safety

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
18
Q

relational method

A

uses a loose description of the construct that is based on the knowledge of experts or members of the target population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
19
Q

propotypical method

A

asks members of the target population to think of persons having the construct and to write down their behaviour that is typical of the construct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
20
Q

internal method

A

starts with a broad pool of personality or attitude items

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
21
Q

external method

A

starts from a broad pool of items and a criterion that has to be predicted (success in job)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
22
Q

construct method

A

starts from an explicit theory, and items are derived from that theory

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
23
Q

facet design method

A

does not use an explicit theory, but it starts from a conceptual analysis of the construct

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
24
Q

behavioural facets

A

classify types of behaviour

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
25
situational facets
classified's the situations where the behaviours appear
26
endorsement response scale
asks the test taker to indicate his her degree of endorsement of the statement
27
dichotomous scale
scale that has only two categories
28
ordinal polytomous
a scale that has more than two ordered categories
29
bounded continuous scale
scale has two end points (bounds)
30
indicative item
item where a high frequency or endorsement response indicates a high level of the construct
31
contra indicative item
is an item where a high frequency or endorsement response indicates a low level of the construct
32
concurrent interview
asks the test taker to think aloud while answering the items
33
retrospective interview
asks the test takers to recollect their thinking after completing the items
34
coefficient of identity
can be applied to assess integrater agreement and intrarater consistency
35
response style
the differential use of the item response scale by different persons, important response styles are ; acquiescence, dissentience, extremity and midpoint response styles
36
acquiescence
is the tendency to agree with an endorsement statement, independently of the content of the statement (yea saying)
37
dissentience
tendency to disagree with an endorsement statement independently of the content of the statement (nay saying)
38
extremity response style
is the tendency to choose extremes of the item response scale
39
midpoint response style
tendency to choose the middle of the response scale
40
social desirability
persons tendency to deceive either oneself or others
41
self deception
tendency to deceive oneself
42
impression management
tendency to deceive others by making a good or bad impression or others
43
content validation
experts evaluate whether the test adequately covers all aspects of the construct
44
observed test score
is computed after the separate test items are scored
45
observed test score
derived from the item scores by taking the unweighted or weighted sum of the item scores
46
latent variable score
is derived from the item responses under the assumption of a latent variable item response model
47
item non-response
means that a test taker did not respond to some of the items of the test
48
imputed item score
score that is substituted for a missing item response
49
estimate of parameter
a value derived from empirical data
50
measurement precision information
applies to the test score of a single person, information is the within person aspect of measurement precision
51
measurement precision reliability
concerns the differentiation of test scores of different test takers from a population
52
true test score
the expected value of the observed test scores of the repeated test administrations in the thought experiment
53
error of measurement
the difference between test taker observed test score and his true score from an abritrary measurement occasion
54
a small amount of information (large within person error variance) means
that test taker observed test score vary widely around his true score across repeated test administrations
55
a large amount of information (small within person error variance) means
that test takers observed test scores do not vary widely around his true score
56
parallel tests
tests that measure the same true score with equal within person error variance and uncorrected errors across hypothetical repeated test administrations for each of the test takers of a population
57
classical test theory
is based on the definitions of test taker j's true score, his error of measurement, and the generalisation to a randomly selected person from a population of persons
58
standard error or measurement of a test
is the square root of the error variance in the population of persons
59
the importance of a lower bound is
that a high value of lower bound implies that the theoretical reliability is high (Cronbach's coefficient alpha)
60
location of the item score distribution is
the place of the scale where item scores are centered
61
dispersion
scatter of the item scores
62
shape
form of the distributions
63
classical item difficutly
of a maximum performance item is a parameter that indicates the location fo the item score distribution in a population of persons
64
classical item attractiveness
of a typical performance item is a parameter that indicates the location of the item score distribution in a population of persons
65
item p value
the mean of a dichotomously scored item
66
classical item discrimination
parameter that indicates the extent to which the item differentiates between the true test scores of a population of persons
67
the item-rest correlation
is the product moment correlation between the item scores and the rest scores of the test, where the studied item is deleted
68
popularity of a distractor of a multiple choice item
is the proportion of persons of a population who selected the distractor
69
item distractor-rest correlations
are the product moment of correlations of the separate dichotomous correct answer/distractor variables and the rest scores
70
the best thing to look at when you want to make a statement about how well an item discriminates between persons
at the item rest correlation
71
the difference between a correlation and a covariance
the covariance depends on the measurement scale of the variables. the correlation doesn't
72
which measurements are on the same measurement scale
variance and covariance
73
reliability is
that part of the variance that is the true-score variance
74
the reliability increases as the
measurement error variance decreases
75
the reliability of a test increases as
the covariances between the items increase
76
what are the requirements that parallel tests would need to fulfil?
the parallel tests must have the same within person error variance for each of the test takers of the population, the errors of measurements of parallel tests must nog be correlated across repeated test administrations, the parallel tests must measure the same true score for each of the test takers of the population
77
reliability
between persons aspect of measurement precision and informs us about the consistency of the test
78
validity
whether a reliable test measures consistently what it is supposed to measure