3rd exam Flashcards
(128 cards)
What is the formula for classical testing theory?
X= T + E (x- observed score), t (true score), E (error, systematic and random)
What creates a problem for classical testing theory?
Guessing on an achievement test could cause the true score to be wrong
Do we know when people guess?
We never know when someone is guessing
Abott’s formula
allows you to understand and calculate true score for blind guessing
If you are guessing wrong what happens within classical testing theory?
the observed score is not reflective of their true score
Abbotts actual math formula
R (correct responses) - W (wrong responses) divided by K (number of alternatives) -1
To overcome the influence of blind guessing
one should advise examinees to attempt every question– since not all guessing is blind. Guessing one can narrow down and get it correct and the number of times blind guessing goes on tends to be less frequent
What is an error in multiple choice questions?
not the question its self but the responses you chose from
What is the error within short-answer questions?
the issue is what is the question asking and how do I answer it? this affects reliability
Ebels idea of reliability and response options
reliability studies have been done on the number of response options, a better way to increase test reliability is to add more items (responses should be around 5)
Speed tests
best way to calculate reliability for speeded tests is to do a split half reliability on the test
With speed tests how should you do reliability
administer half the test and give half the time to complete the test, also administer 2 weeks apart, better indicator of reliability
Halo Effect
raters tendency to perceive an individual who is high (or low) in one areas is also high (or low) in other areas
2 kinds of halo effects
general impression model and salient dimension model
General impression model
tendency of rater to allow overall impressions of an individual influence judgment of a persons performance (ex: person may rate reporter as “impressive” and thus, also rate him/her as her speech as strong)
Salient dimension model
take one quality from the person and that affects the rating of another quality of the person (ex: people rated as attractive are also rate as more honest) (make inferences about an individual based on one salient trait or quality)
Simpson paradox
aggregating data can change the meaning of the data, can obscure the conclusions because of a third variable
Percentages are at the heart of the simpson paradox, why are they bad?
because they obscure the relationship between the numerator and denominator (ex: 8/10 is 80% but also 80/100 80% is the same but number of people who reviewed a restaurant is different)
What is important in knowing the percentage?
you need to know what the numerator and denominator are, or you are misinterpreting the percentages
What happens when you disaggregate the data?
you can truly see if the phonomenon is actually occurring in simpson paradox
Clinical Decision-Making
make decisions on own clinical experience
Mechanical decision-making
make decisions based on data or statistics
Clinical psychologists often feel that their decision making is
absolute, but it is flawed because there are biases that we pull that affect our decisions
Robin Dawes
asserts that mechanical prediction is better than clinical prediction