chapter 5 Flashcards
(62 cards)
- Is a synonym for dependability or consistency.
- Refers to consistency in measurement.
Reliability
Is an index of reliability, a proportion that indicates the ratio between the true score variance on a test and the total variance.
Reliability coefficient
That a score on an ability test is presumed
to reflect not only the testtaker’s true score on the ability being measured but also error.
Classical test theory
- Variance from true differences.
True variance
- A statistic useful in describing sources of test score variability.
- This statistic is useful because it can be broken into components.
- The standard deviation squared.
Variance
Refers to collectively, all of the factors associated
with the process of measuring some variable, other than the variable being measured.
Measurement error
Variance from irrelevant, random sources
Error variance
Is a source of error in measuring a targeted variable caused by unpredictable fluctuations and inconsistencies of other variables in the measurement process.
- Sometimes referred to as “noise,” this source of error fluctuates from one testing situation to
another with no discernible pattern that would systematically raise or lower scores.
Random error
Refers to a source of error in measuring a
variable that is typically constant or proportionate to what is presumed to be the true value of the variable being measured.
Systematic error
Sources of Error Variance:
Sources of error variance include test construction, administration, scoring, and/or
interpretation.
Terms refer to variation among items within a test as well as to variation among items between tests.
- Under test construction
Item sampling or content sampling
Sources of error variance that occur during test administration may influence the testtaker’s attention or motivation. The testtaker’s reactions to those influences are the source of one kind of error variance.
- Examples of untoward influences during
administration of a test include factors related to the: room temperature, level of lighting, and amount of ventilation and noise, for instance.
Test environment
- Other potential sources of error variance during test administration are: pressing emotional problems, physical discomfort, lack of sleep, and the effects of drugs or medication can all be sources of error variance.
Test-taker variables
The examiner’s physical appearance and demeanor—even the presence or absence of an examiner—are some factors for consideration here.
Examiner-related variables
- In many tests, the advent of computer scoring and a growing reliance on objective, computer-scorable items have virtually eliminated error variance caused by scorer differences.
- However, not all tests can be scored from grids blackened by no. 2 pencils. Individually administered intelligence tests, some tests of personality, tests of creativity, various behavioral measures, essay tests, portfolio assessment, situational behavior tests, and countless other tools of assessment still require scoring by trained personnel.
Test scoring and interpretation
- Surveys and polls are two tools of assessment commonly used by researchers who study public opinion.
- Certain types of assessment situations lend themselves to particular varieties of systematic
and nonsystematic error.
Other sources of error
Reliability Estimates:
- Test-Retest Reliability Estimates
- Parallel-Forms and Alternate-Forms Reliability Estimates
- Split-Half Reliability Estimates
Is an estimate of reliability obtained by correlating pairs of scores from the same people on two different administrations of the same test.
- Is appropriate when evaluating the reliability of a test that purports to measure something
that is relatively stable over time, such as a personality trait.
Test-retest reliability
When the interval between testing is greater than six months, the estimate of test-retest reliability is often referred to as
Coefficient of stability
The degree of the relationship between various forms of a test can be evaluated by means of an alternate-forms or parallel-forms coefficient of reliability, which is often termed the
Coefficient of equivalence
- of a test exist when, for each form of the test, the means and the variances of observed test scores are equal.
Parallel forms
Refers an estimate of the extent to which item sampling and other errors have affected test scores on versions of the same test when, for each form of the test, the means and variances of observed test scores are equal.
Parallel forms reliability
Are simply different versions of a test that
have been constructed so as to be parallel.
Alternate forms
Refers an estimate of the extent to which these different forms of the same
test have been affected by item sampling error, or other error.
Alternate forms reliability