CH8: Test Development Flashcards
(69 cards)
what is an umbrella term for all that goes into the process of creating a test?
test development
what are the 5 stages of developing a test?
- Test Conceptualization
- Test Construction
- Test Tryout
- Item Analysis
- Test Revision
this stage of test development entails writing test items (or rewriting or revising existing items), as well as formatting items, setting scoring rules, and otherwise designing and building a test
Test Construction
this stage of test development involves statistical procedures employed to assist in making judgments about which items are good as they are, which items need to be revised, and which items should be discarded
item analysis
in this type of test, items should measure whether test-takers meet specific criteria, regardless of their position relative to others.
Success is defined by meeting set criteria, not by ranking.
criterion-referenced tests
this stage of test development refers to action taken to modify a test’s content or format for the purpose of improving the test’s effectiveness as a tool of measurement
test revision
what are the key considerations when developing a test?
- Purpose
- Need and Users
- Content and Administration
- Test Forms
- Training
- Responses and Impact
- Score Interpretation
in this type of test, items are deemed “good” if high scorers answer correctly and low scorers answer incorrectly.
norm-referenced tests
this term refers to the preliminary research and testing around the creation of a test prototype
pilot work
what is the purpose of pilot work/studies?
pilot studies help evaluate the potential test items to determine their suitability for the final version of the test
this term refers to the process of setting rules for assigning numbers or indices to measure different amounts of a trait, attribute, or characteristic.
scaling
what are the different types of scaling methods?
- Rating Scale
- Likert Scale
- Method of Paired Comparisons
- Comparative Scaling
- Guttman Scale
- Direct Estimation
- Thurstone Scale
this type of scale is a summative scale where test-takers rate the strength of a trait, attitude, or emotion
rating scale
this type of scale is commonly used in psychology for attitudes, providing options on a continuum
likert scale
what does a unidimensional rating mean?
the scale only measures one underlying dimension
what does a multidimensional rating mean?
the scale measures multiple dimensions
this type of scale produces ordinal data by comparing stimuli
method of paired comparisons
this type of scale involves methods like equal-appearing intervals, where responses are directly rated without conversion
Direct Estimation
this type of scale involves judgement of stimuli in relation to others on the scale
comparative scaling
this type of scale yields ordinal-level measurements, using scalogram analysis to map responses
Guttman Scale
what are the preliminary questions you should think about when writing items?
- What content areas should be covered?
- Which item formats are best?
- How many items are needed overall and per content area?
this term refers to the collection of potential test items that will be refined for the final test.
Item Pool
what are the two types of item formats?
- Selected-Response Format
- Constructed-Response Format
this type of item-format requires choosing an answer from given options
(e.g., multiple-choice, true-false)
selected-response format