Chapter 13: Deriving Standardized Scores Flashcards
(118 cards)
This is a bell-shaped, mathematically defined, smooth curve in which the mean, median and mode lie in the exact center. This tapers as it proceeds away from the center of the distribution and, theoretically, approaches the horizontal axis without ever touching it. This serves as the basis from the interpretation of an individual’s norm-reference test scores. This allows for comparisons to be made about different client’s scores on the same test, or about the same client’s scores on different tests.
The Normal Curve
This describes how the scores spread out from the middle of the distribution. i. 34.13% of all scores fall between the mean (0 SD) and one standard deviation above (or below) the mean. ii. 13.59% of all scores fall between one standard deviation and two standard deviations (whether above or below the mean). iii. 2.14% of all scores fall between two standard deviations and three standard deviations (whether above or below the mean). iv. Only 0.13% of all scores fall beyond three standard deviations (whether above or below the mean). c. Other mathematical constants of the normal curve: i. Half (50%) of the scores always fall below the mean, and half always fall above the mean. ii. About 68% (68.26%) of all scores fall between one standard deviation above the mean and one standard deviation below the mean (±1 SD). iii. About 95% (95.44%) of all scores fall between two standard deviations above the mean and two standard deviations below the mean (±2 SD). iv. About 99% (99.74%) of all scores fall between three standard deviations above the mean and three standard deviations below the mean (±3 SD). d. The normal curve has the wonderful property of equalizing all sorts of standard score scales (e.g., T scores, deviation IQs, z-scores). That is, a deviation IQ of 130, T score of 70, and z-score of +2.00 will always mean the score falls at the 98th percentile. This interchangeability is one of the valuable characteristics of the normal curve.
Standard Deviation
These are raw scores that are meaningless without some context for interpretation. As long as a distribution of scores does not violate an assumption of the normal curve (e.g., skewness), raw scores can be linearly transformed into a standardized score and interpreted with greater meaning.
Standardized Scores Based on the Normal Curve
This involves converting a (normal) raw score distribution with a given mean and standard deviation into a standardized score distribution conforming to the normal curve characteristics with a standardized mean and standard deviation. i. Making a raw data set “fit” the normal curve allows researchers and test interpreters to communicate a lot about a client’s test score and performance.
Linear Transformation
This has a mean of zero, and a standard deviation of one and is computed from the raw score distribution using the following formula: where X is the participant’s raw score M is the sample mean SD is the sample standard deviation –1.00 means 1.00 standard deviations; the minus sign indicates “below the mean”; a positive 1.00 standard deviation indicates “above the mean”
The z-score
This provides context for meaningful interpretation by declaring that the mean of a distribution of scores is 50 and the standard deviation is 10. T scores greater than 50 are above the mean, and T scores of less than 50 are below the mean. The interpretation of scores is similar to z-scores, but the index of comparison has shifted T scores: M = 50, SD = 10
A T Score
This is a Deviation IQs, which in the education and counseling fields are frequently referred to simply as “standard scores,” are commonly used to describe scores on intelligence, achievement, and perceptual skills tests. b. A deviation IQ score provides context for meaningful interpretation by declaring that the mean of a distribution of scores is 100 and the standard deviation is 15. i. Nearly all tests use a standard deviation of 15. ii. Standard scores greater than 100 are above the mean, and standard scores of less than 100 are below the mean. The formula for converting a raw score into an SS (standard score) score is or SS = 15 (z) + 100. 4.
Deviation IQ and Standard Scores (M = 100; SD = 15)
These are the Types of Standard Scores Used in Counseling, Education, and Research a. z-scores b. T scores c. Deviation IQ scores d. Normal curve equivalents (NCEs) are standard scores with a mean of 50 and a standard deviation of 21.06.
a. z-scores b. T scores c. Deviation IQ scores d. Normal curve equivalents (NCEs) are standard scores with a mean of 50 and a standard deviation of 21.06.
This is short for “standard nine,” a system that divides the normal curve into nine equidistant segments, using the formula 2z + 5 and rounding to the nearest whole number. i. Stanines 2 through 8 represent a ½ standard deviation range with the 5th stanine straddling the mean (i.e., ±¼ SD [z-scores of –0.25 to +0.24]). ii. Stanines are frequently used in large-scale achievement testing programs, but they should be interpreted with caution.
A Stanine
These have a mean of 10 and a standard deviation of 3, and are frequently used in intelligence, achievement, or perceptual skills measures to report on subtest or subscale scores. i. The scaled scores for these several subtests can be summed and converted into a standard score (M = 100; SD = 15). ii. One example is the Wechsler Scales (e.g., Wechsler Adult Intelligence Scales)
Scaled Scores
These have a mean of 500 and standard deviation of 100. i. The Scholastic Assessment Test (SAT) is an example.
CEEB (College Entrance Examination Board) Scores
This indicates the percentage of observations that fall below a given score on a measure plus one-half of the observations falling at the given score. i. Percentile ranks are easy to understand when one visualizes a line of 100 individuals with characteristics similar to the reference group under study (e.g., age, grade, sex), with the first individual standing in line possessing the least amount of the construct under study and the 100th person standing in line possessing the greatest amount of the construct under study.
Percentile Ranks
In this, if a participant scored at the 37th percentile rank (i.e., P37), this means that her performance exceeded 37 percent of all those comprising the reference group. We compute percentile ranks from a raw score distribution using the following formula: where PR is the percentile rank n is the sample size cf is the cumulative frequency f is the frequency of occurrence of the value being determined ii. Percentile ranks should not be confused with percentage scores. A percentage is the percent correct out of the total.
Percentile Ranks
This indicate an individual’s score as referenced to a comparison group of individuals with like characteristics—thus the term norm-referenced. 2. A percentage score indicates only the percentage of responses that met some criterion of correctness but not referenced or related to any group of individuals or scores—thus the term criterion-referenced
Percentile Ranks
These are not in equal intervals so they cannot be subjected to mathematical operations such as addition, subtraction, or multiplication. 1. These must be converted to standard scores, subjected to mathematical operations, and then converted back into percentiles. iv. These are relatively easy to understand and to explain to those who have little or no test sophistication. v. Reporting a quartile is a common way of dividing a percentile rank distribution into four portions. 1. Q1 covers P0–P24; Q2, P25–P49; Q3, P50–P74; and Q4, P75–P99
Percentile Ranks
This can help convey performance information to those individuals who are unskilled in test score interpretation. Interpretive ranges are verbal performance descriptors such as Average, High Average, and Very Superior which help clients understand their performance in simple terms.
An Interpretive Range
In this, Counselors can use conversion tables to convert different types of scores to make test interpretation more consistent. Conversion of these diverse types of scores (e.g., T scores, z-scores, percentile ranks, and deviation IQs) to a single scale makes comparisons simple and meaningful.
Converting One Standardized Score into Another
In this, it is sometimes of interest to determine the difference or distance between certain types of standardized scores, as well as the area between certain scores under the normal curve (such as figuring out what percent of the population may fall between two standard scores). b. To do this, standardized scores are converted into z-scores and compared using a table of values for areas under the normal curve. Using the information in the table, the two scores can be subtracted or manipulated depending upon what the researcher is trying to find. i. If the z-score is positive, the value from the table is added to .50 to indicate the area above the mean.
Finding the Distance Area Between Given Scores
Researchers frequently analyze theories or models, develop questions or hypotheses about how variables will behave as predicted by these theories or models, and design experiments to collect data that will answer the questions/hypotheses, thus confirming or rejecting the expected results. This process is based on logical, analytical questioning procedures. b. The statistics used in a study are generally inferential statistics because a researcher makes a judgment on a population parameter based on sampling data. i. If a sample of participants has been randomly or in another way faithfully obtained, the assumption is that the results represent or extend to the population of interest. c. When conducting a study for empirical purposes, three steps are required: i. Translate the research question/hypothesis into a statistical (null) hypothesis. ii. Design, conduct, and analyze the results (data) of the study using an inferential statistic. iii. In this, you determine whether to reject or retain (never accept!) the null hypothesis.
Process: Statistical Hypothesis Testing
This is the first step in statistical hypothesis testing is to convert the research question or research hypothesis into a statistical hypothesis, more commonly known as the null hypothesis and alternative hypothesis. i. Because inferential statistics are about a population, the hypotheses are created using parameter statistics.
Steps to Identifying the Null and Alternative Hypotheses
This is the hypothesis that is rejected or retained with inferential statistics and is often the opposite of what the researcher believes to be true (i.e., no difference exists).
The Null Hypothesis (HO)
This is generally the research hypothesis and is a statement of what occurs if the null hypothesis is rejected. They are typically written in statistical notation as follows:
i. In hypothesis testing, the null hypothesis is always tested.
ii. The null hypothesis (H0) indicates that there is no difference between the groups. If the null hypothesis is retained, then no statistical group differences were found.
iii. If the null hypothesis is rejected, then the alternative hypothesis (H1) holds true (i.e., statistical differences do exist between the groups).
The Alternative Hypothesis (H1)
iv. The researcher never “accepts” that the null hypothesis (H0) is true. This is because the null hypothesis is a probability-based statement, and there are at least two additional reasons the experimenter must consider to explain the result:
1. A true difference may have existed, and the sample result did not faithfully express this true population result.
2. There may have been bias in the experimental procedures that led to a conclusion of no difference when a difference did, in fact, exist.
a. Retaining or rejecting the null hypothesis is directly related to the amount of error the researcher chooses to allow in the study.
b. Whether H0 is retained or rejected, researchers reach one of two decision outcomes: The researcher either makes a correct decision or an incorrect decision.
c. There are two types of error to consider in hypothesis testing: type I error and type II error.
The Alternative Hypothesis (H1)
. Type I Error a. This type of error occurs when the null hypothesis is rejected but should have been retained.
i. The researcher determined that statistically significant differences existed between the groups when the difference actually did not exist.
ii. The amount of type I error is identified as α (alpha).
iii. The researcher has some control over alpha, although sample size also influences alpha.
iv. The researcher always establishes the amount of type I error allowed at the outset of the study.
Type I Error