Assessment vocabulary Flashcards
(15 cards)
Norm-referenced
tests include the SAT, IQ tests, and tests that are graded on a curve. Anytime a test offers a percentile rank, it is a norm-referenced test.
Criterion-referenced
tests include Advanced Placement exams A criterion-referenced test is a style of test that uses test scores to generate a statement about the behavior that can be expected of a person with that score
Mean
The mean (average) of a data set is found by adding all numbers in the data set and then dividing by the number of values in the set.
Median
the middle score of a list of scores smallest to largest
Stanine
statistical units from a scale of 1 to 9 which is used to indicate a performance level
Percentile
Percentile in statistics, the percentile rank of a score is the percentage of scores in its frequency distribution that are less than that score
normal curve equivalent - or NCE,
a way of measuring where a student falls along the normal curve. The numbers on the NCE line run from 0 to 100, similar to percentile ranks, which indicate an individual student’s rank, or how many students out of a hundred had a lower score.
standard deviation
standard deviation - how spread out the data is. It is a measure of how far each observed value is from the mean. In any distribution, about 95% of values will be within 2 standard deviations of the mean
standard error of measurement
(SEm) is a measure of how much measured test scores are spread around a “true” score. The SEm is especially meaningful to a test taker because it applies to a single score and it uses the same units as the test.
P-value - A p-value i
a measure of the probability that an observed difference could have occurred just by random chance.
The lower the p-value, the greater the statistical significance of the observed difference.
P-value can be used as an alternative to or in addition to pre-selected confidence levels for hypothesis testing.
scaled score - A scaled score is
A representation of the total number of correct questions a candidate has answered (raw score) that has been converted onto a consistent and standardized scale.
item response theory
It is widely used in education to calibrate and evaluate items in tests, questionnaires, and other instruments and to score subjects on their abilities, attitudes, or other latent traits.
Grade-equivalent
An age or grade equivalent is simply the median raw score for a grade level. Because the acquisition of skills measured by an instrument such as a vocabulary test occurs more rapidly during early ages, raw scores increase at a greater rate with younger examinees than with older examinees.
item discrimination
the ability of an item to differentiate among students based on how well they know the material being tested
A standardized test is
any form of test that (1) requires all test takers to answer the same questions, or a selection of questions from a common bank of questions, (2) is scored in a “standard” or consistent manner, which makes it possible to compare the relative performance of individual students or groups of students.