Lecture 4.2 Reliability Flashcards

Question

amount of time between administrations | Any interventions, treatment or trauma, taking place between test administrations;

Answer 1

Test-retest reliability will be affected by

Answer 2

Different versions of a test, matched for content and difficulty

Answer 3

Scores from one half of a test are correlated with the other half of the test, using equivalent halves • Random, odds & evens, content & difficulty

Answer 4

The degree of agreement between two or more scorers. Reduced by appropriate training.

Answer 5

correlate scores from 2 administrations of the same test

Answer 6

correlate scores from 2 versions of the same test

Answer 7

correlate scores from 2 equivalent halves of the same test

Answer 8

correlate items within the same test

Answer 9

correlate scores from 2 scorers for one test taker

Answer 10

Indicates the ratio between the true score variance on a test and the total variance Range from 0 to 1: closer to 1, the higher the reliability

Answer 11

__________________ test unifactorial, so consist of items measuring a single trait or factor

Answer 12

________________ test is multifactorial, so measure more than one trait or factor

Answer 13

a characteristic, trait, or ability that is presumed to be relatively unchanging

Answer 14

a characteristic, state, or ability that is presumed to be ever changing as a function of situational and cognitive experiences

Answer 15

sampling procedure used to gather the test scores does not result in a full spread of scores (e.g., having only university students complete an IQ test)

Answer 16

when the sample includes people who are outside of the range of the test so the scoring range is inflated (e.g., adults completing a test designed for children)

Answer 17

all items of equal difficulty, and time limited so that no-one is likely to be able to answer all items

Answer 18

time limit is long enough for all items to be attempted, but some items are so difficult that no-one is likely to get them all right

Answer 19

Designed to provide an indication of where a test taker stands with respect to some criterion (i.e., pass/fail type tests)

Answer 20

The extent to which evidence supports the meaning and use of a psychological test (or other assessment device)

Answer 21

A correlation coefficient that provides a measure of the relationship between test scores and scores on the criterion measure

Answer 22

How well a test or measurement tool measures what it purports to measure in a particular context

Answer 23

focuses on three categories of validity

Answer 24

Type of validity - scrutinizing the test’s content

Answer 25

Type of validity - relating scores obtained on the test to other test scores or other measures

Answer 26

Type of validity - ‘umbrella validity’; comprehensive analysis of how test scores relate to scores on other tests/measures & how test scores relate to the construct that the test was designed to measure

Answer 27

_____________ view takes everything into account, from implications of test scores in terms of societal values to the consequences of use

Answer 28

* The process of gathering and evaluating validity evidence. * Test developer is responsible for supplying validity info in the test manual and/or through a ‘test validation’ journal article

Answer 29

• Describes a judgement of how adequately a test samples behaviour representative of the universe of behaviour that the test was designed to sample

Answer 30

Type of content validity | A judgement concerning how relevant the test items appear to be to the test-taker

Answer 31

Important in employment settings, where tests are used to hire & promote • Tests must be shown to include relevant items in terms of job skills required for the position • Lawshe (1975): • Is the skill or knowledge measured by this item: 1) Essential; 2) Useful but not essential; 3) Not necessary to the performance of the job?

Answer 32

C____________ has an impact on judgements concerning the validity of tests and test items

Answer 33

C __________ R________ V __________ A judgement of how adequately a test score can be used to infer an individual’s most probable standing on some measure of interest – the measure of interest being the criterion

Answer 34

A _____________ is the standard against which a test or test score is evaluated -can be almost anything:

Answer 35

A criterion should be: 1. R___________ – pertinent or applicable to the matter at hand 2. V___________ for the purpose for which it is being used 3. U____________ – not based on predictor measures

Answer 36

P ______________ V ______________ is the degree to which a test score predicts a criterion measure at a future time

Answer 37

C___________ v_________ is the degree to which a test score is related to a criterion measure that is obtained at (about) the same time

Answer 38

I___________ V__________ The degree to which an additional predictor explains something about the criterion measure that is not explained by predictors already in use

Answer 39

test takers predicted not to show characteristic but do

Answer 40

test takers predicted to show characteristic but don’t

Answer 41

M_____ r_______the proportion of people incorrectly classified

Answer 42

H________ r_______the proportion of people correctly identified

Answer 43

B______ r________ the extent to which a particular trait, behaviour, characteristic or attribute exists in the population

Answer 44

C_________ v___________ A judgement about the appropriateness of inferences drawn from test scores regarding individual standings on a variable called a construct

Answer 45

``` Evidence of construct validity H_____________ of items Changes with a____ Pre-test to p_____________changes G________ differences C__________ evidence D__________ evidence F_________ analysis ```

Answer 46

E__________ of h___________ - How uniform the test is in measuring a single concept

Answer 47

Some constructs are expected to change with age, particularly during childhood/adolescence

Answer 48

Evidence that scores change as the result of some experience between a pre-test and a post-test can be evidence of construct validity

Answer 49

Demonstrating that scores on the test vary in a predictable way as a function of membership in some group

Answer 50

When test scores on a new test are found to correlate highly in the predicted direction with scores on a older, more established and validated test designed to measure the same construct

Answer 51

Shown when test scores are found to have little or no relationship with test scores or variables for which theoretically there should be no relationship

Answer 52

Can be used to determine both convergent and discriminant evidence of construct validity

Answer 53

A factor structure is explicitly hypothesised and is tested for its fit with the observed covariance structure of the measured variables

Answer 54

Estimating or extracting factors, deciding how many factors to retain, rotating factors to an interpretable orientation

Lecture 4.2 Reliability Flashcards

(78 cards)