Test Development Flashcards
(63 cards)
what is an umbrella term for all that goes into the process of creating a test?
test development
what are the 5 stages of developing a test?
- Test Conceptualization
- Test Construction
- Test Tryout
- Item Analysis
- Test Revision
this stage of test development involves statistical procedures employed to assist in making judgments about which items are good as they are, which items need to be revised, and which items should be discarded
item analysis
this stage of test development entails writing test items (or rewriting or revising existing items), as well as formatting items, setting scoring rules, and otherwise designing and building a test
Test Construction
in this type of test, items should measure whether test-takers meet specific criteria, regardless of their position relative to others.
Success is defined by meeting set criteria, not by ranking.
criterion-referenced tests
this stage of test development refers to action taken to modify a test’s content or format for the purpose of improving the test’s effectiveness as a tool of measurement
test revision
in this type of test, items are deemed “good” if high scorers answer correctly and low scorers answer incorrectly.
norm-referenced tests
this term refers to the preliminary research and testing around the creation of a test prototype
pilot work
what is the purpose of pilot work/studies?
pilot studies help evaluate the potential test items to determine their suitability for the final version of the test
this term refers to the process of setting rules for assigning numbers or indices to measure different amounts of a trait, attribute, or characteristic.
scaling
what are the different types of scaling methods?
- Rating Scale
- Likert Scale
- Method of Paired Comparisons
- Comparative Scaling
- Categorical Scale
- Guttman Scale
- Thurstone Scale
this type of scale is a summative scale where test-takers rate the strength of a trait, attitude, or emotion
rating scale
this type of scale is commonly used in psychology for attitudes, providing options on a continuum
by asking respondents to rate their agreement with a statement
likert scale
what does a unidimensional rating mean?
the scale only measures one underlying dimension
what does a multidimensional rating mean?
the scale measures multiple dimensions
this type of scale produces ordinal data by comparing stimuli
presents two items at time, asking respondents to choose one based on a specific criterion
method of paired comparisons/paired comparison scale
this type of scale involves judgement of stimuli in relation to others on the scale
rating an item relative to a benchmark or another item on the scale
comparative scaling
this term refers to the collection of potential test items that will be refined for the final test
Item Pool
this type of item-format requires choosing an answer from given options
(e.g., multiple-choice, true-false)
selected-response format
what are the two types of item formats?
- Selected-Response Format
- Constructed-Response Format
this type of items include a “stem,” a correct option, and distractors.
multiple-choice items
this type of items include two possible responses, such as true/false
binary-choice item
this type of item involves matching premises with correct responses
matching items
this term refers to interactive testing where item selection depends on previous answers, reducing floor and ceiling effects
computerized adaptive testing (CAT)