L3: Individual Differences & validation Flashcards

Question

what are the 3 types of range restriction?

Answer 1

Direct range restriction – If only high-scoring applicants are selected, we underestimate the test’s true predictive power. Indirect range restriction – If a selection method is based on a different variable (e.g., motivation letters instead of grades). Natural attrition – If weak or strong performers quit early, reducing data availability.

Answer 2

Whether a test actually measures the concept it’s supposed to measure. - subtypes: convergent & discriminant validity

Answer 3

- C: should be related to scores on other measures of the same construct - D: should be unrelated to scores on instruments that are not supposed to be measures of that construct (theoretically distinct)

Answer 4

meta analysis

Answer 5

- general mental ability (or cognitive ability): best predictor of job success, especially for complex jobs - the big 5: conscientiousness strong predictor of job success (but too much of it -> perfectionism, rigidity). job complexity affects how much personality matters

Answer 6

higher GMA = faster learning, better problem solving

Answer 7

Whether a test looks like it measures what it should (even if it might not). - not technical but may affect applicants reactions to test (motivation, sense of fairness)

Answer 8

- there are many individual level differences (personality, motivation, skills etc) - we aim to describe this variability & understand it, explain it, and predict it - measurement is a tool that helps us achieve that goal - also important in HR

Answer 9

- decision making: personnel decisions, evaluation of employees - HR specialists need to be able to: select & use psych measurements, interpret the results, communicate these results to others can affect careers of individuals, so essential for HR specialists to understand the application of psych measures. need to judge how effective they are.

Answer 10

1. Are we actually measuring what we want to measure? 2. Are we measuring this consistently? 3. Does our measure of the predictor (e.g. personality) actually predict future performance?

Answer 11

any psych measurement instrument, technique or procedure that systematically mesures a sample of behaviour (eg interview, presentation, performance test etc)

Answer 12

1. item generation (steps 1-4) 2. pilot test (steps 5-7) 3. post pilot activities (steps 8-10)

Answer 13

1. determine purpose (are u trying to evalute performance? select applications? doing research?) 2. define attribute (whats the content, so which constructs should/shouldnt be included) 3. develop measure plan (formatting, what will the anchors be) 4. write items (include some reverse items): double than what u need (if u want 10 item scale, dev 20 items), be specific & concrete simple, avoid negation

Answer 14

5. pilot test w representative sample 6. feedback from pilot test sample on test-perceptions & item clarity -> content validity 7. item analysis (distractor analysis, item difficulty, item discrimination) -

Answer 15

frequency of each incorrect item (should be approx equal)

Answer 16

nr who answer correct/total (approx 5)

Answer 17

how well does the item serve to discriminate between better vs worse performers

Answer 18

8. select items (good to have normal distribution- not too hard & not too easy) 9. determine reliability & validity 10. reivse & update items

Answer 19

1. content: items are systematically chosen from behavioural domain to be measured 2. administration: standardized procedures so that each time the test is given, same directions, same recording of answers, w same time limits & as little distractions as possible 3. scoring: rules are specified in advance

Answer 20

to minimize contamination on test scores

Answer 21

- cost - face validity - contamination on test scores - interpretation of the test results by examiner

Answer 22

- **what** should you measure?: job analysis give u clus about what variables are related to job success (ex: negotiation skills) - **where** & **how** do you find the measure youre looking for? : does it already exist or do i have to create it? only create when unavailable then run a pilot and assess the items (difficulty, discirmination etc)

Answer 23

the process of assigning numbers to objects or events according to rules - goal is to quantify individual differences in traits like intelligence, personality, job performance etc

Answer 24

- cognitive ability tests - personality inventories (like big 5) - situational judgment tests (SJTs) - interviews

Answer 25

- nominal - ordinal - interval - ratio

Answer 26

- categorizes data without any order ex: male vs female, eye colour etc

Answer 27

ranks data but does not indicate precise differences - ex: ranking in a race: you know 1st place is better than 2nd but u dont know if the time difference between them is equal

Answer 28

- equal intervals between values but no true zero (so also no 2x as much statements) - like temperature: difference between 20 and 30 c is same as between 30 and 40 c + but zero doesnt mean "no temperature" (theres no true zero); IQ scores

Answer 29

like interval (equal intervals between values) but with an absolute zero ex: weight, height, age

Answer 30

count, mode

Answer 31

median, percentiles, rank order correlation

Answer 32

means, sds, correlation, regression

Answer 33

all math operations, including ratios & percentages

Answer 34

many psych traits are measured on interval scales, even tho they dont have a true zero

Answer 35

1. Determine the Measure’s Purpose – What are you trying to assess? 2. Define the Attribute – Is it cognitive ability, job performance, or personality? 3. Develop a Measure Plan – What type of questions will be used? 4. Write Items – Ensure the questions or tasks are clear and relevant. 5. Conduct a Pilot Study & Traditional Item Analysis – Test the measure before full implementation. 6. Use Item Response Theory (IRT) to analyze the questions – Identify good vs. bad items. 7. Revise & Update Items – Remove ineffective or biased questions.

Answer 36

IRT is used to evaluate individual test items based on difficulty, discrimination, and guessing

Answer 37

Determines how hard a question is.

Answer 38

Measures how well a question differentiates between high- and low-ability test takers.

Answer 39

Some multiple-choice items can be answered correctly just by luck so need to account for this

Answer 40

it assess reliablity in mesurement by analyzing MULTIPLE sources of error to give a more complete picture of a tests accuracy ex: a test score might be influenced by time errors (morning vs evening test taking conditions), rating bias (some interviewers might be stricter), item variance (some questions might be harder than others) G theory helps separate these factors to improve measurement accuracy

Answer 41

some measurement tools lack precision cause they dont offer enough response options ex: survey w only agree or disagree solution: use continuous or fine grained scales where possible

Answer 42

changes in how we measure individual differences - AI in testing - gamification - mobile & online assesments - new biometric & neuroscientific approaches (eye tracking, brain scans etc)

Answer 43

a test must be reliable to be valid, but a reliable test is not necessarily valid ex: a rule that gives different lengths for the same object is unreliable, but if it consistenly gives wrong lengths, its reliable but not valid Validity≤ sqrt (Reliability) low reliability places a ceiling on valiidty. even if a test is somewhat valid, if it has low reliabliity, its usefulness is limited

Answer 44

helps ensure a test remains valid across different samples

Answer 45

1. conduct an initial validity study 2. apply the test in a new group & see if results remain the same 3. if the validity coefficient drops, this is called **shrinkage**

Answer 46

- prevent overfitting: test working well in one study but not in real world settings - ensures predictiv epower remains stable across different samples

Answer 47

using previous research & job analysis to infer test validiity used when local validation is not possible (small organisation often lacks neough data to conduct full validation studies) ex: instead of testing 1k salespeople, use existing data from other companies to support a tests validity

L3: Individual Differences & validation Flashcards

(71 cards)