Study Cards Flashcards

Question

What is systematic effect?

Answer 1

It is the primary cause of the score. How much of the construct you have

Answer 2

An affect that effects a subgroup. EX: a delayed train effects commuters

Answer 3

Random factors that affect the score of an individual, but have no relationship to the construct. EX: poor sleep

Answer 4

A formal definition defines the construct for what it is while an operational definition defines how it is measured

Answer 5

It captures the challenges we face when measuring constructs that cannot be directly seen. The shadows and they symptoms are observed and interpretations must be made

Answer 6

A persons true score is different from their observed score (due to error)

Answer 7

T= X +/- E

Answer 8

Error is just as likely to be positive as negative

Answer 9

An attempt to directly estimate an individual’s ‘true score’ by examining how individuals respond to questions - as a function of their ability

Answer 10

Shows the minimum required ability to get an answer correct

Answer 11

The evaluation of different approaches to find which one best explains the data in that case

Answer 12

When there are two theories that explain the data equally well, the most simple explanation is most often better

Answer 13

Criterion validity is the correlating of scores with some external criterion that is relevant to the purpose of the test

Answer 14

The methods used to test validity

Answer 15

1. A test cannot be labeled as valid or invalid without respect to a given purpose 2. Assessments of validity must include an assessment of the content of the instrument and its relation to the purpose 3. Different forms of validity evidence are required for different types of instruments 4. Some measures are obviously valid (face validity) and require no further study

Answer 16

1. Content validity 2. Structural validity 3. External validity 4. Item validity

Answer 17

1.Domain representativeness 2. Domain relevance 3. Face validity

Answer 18

The content represented by the construct The degree to which a test measures all aspects of a criterion

Answer 19

The extent to which the questions/tasks/etc. measure the entire domain

Answer 20

The extent to which the questions are relevant to assessing the construct

Answer 21

The signs and symptoms that MUST be present to have the construct

Answer 22

The signs and symptoms that CANNOT be present for the criteria

Answer 23

Domain relevance, these criteria are considered more important or more relevant

Answer 24

Whether the test APPEARS to measure a given construct

Answer 25

The components that a test measures

Answer 26

1. Dimensionality 2. Order

Answer 27

The number of factors the questions can be attributed to (pieces of the cake)

Answer 28

The number of tiers that are needed to explain how the different factors are interrelated (layers of the cake)

Answer 29

1. Criterion validity 2. Convergent and divergent validity 3. Predictive validity 4. Incremental validity

Answer 30

The manner to which test scores are related to other constructs

Answer 31

The extent to which test scores on questionnaire are related to some other outcome or condition

Answer 32

The degree to which it a measure is correlated with other measures

Answer 33

The degree to which a measure does not correlate with other measures

Answer 34

Should converge: r>0.70 - good convergent validity, r<0.30 - poor convergent validity Should diverge: r>0.70 - poor divergent validity, r<0.30 - good divergent validity Anything in between is mild, and depends the theory.

Answer 35

It shows the correlates of different traits and how well they converge to measure the same construct

Answer 36

The traits are listed down the side and along the top, grouped by test (method), and shows the correlation coefficient in the cross section of each individual trait

Answer 37

Concurrent (predicts a criterion measured at the same time) and prospective (predicts a criterion observed in the future) validity

Answer 38

The degree to which a new (additional) measure adds the prediction of a criterion - beyond what can be predicted by some other measure

Answer 39

Tests that have preset answers that cannot be changed or elaborated

Answer 40

The answer can only be yes or no

Answer 41

A range of replies (typically from strongly agree to strongly disagree) in which a person rates how much they agree with a statement

Answer 42

The subject must rank each statement (example: most important - least)

Answer 43

The questions do not have predetermined responses, allowing for elaboration

Answer 44

Questions that allow the participants to come up with their own responses

Answer 45

When the respondents rate their level of a construct on a continuous scale

Answer 46

They are statements that help specify what each number refers to in the real world 1. Rarely or never -

Answer 47

The variability within a group - differences in individual scores

Answer 48

Variability across distributions - differences between groups

Answer 49

How ability and probability of correctness correlate

Answer 50

Mean: the average Mean = the sum of the population scores / the number of scores μ = ΣN/N

Answer 51

Stand. Dev = the square root of the sum of scores - mean squared / number of scores σ = √ (x-μ)^2 / N

Answer 52

The differences in scores Variance - sum of (scores-mean) squared / total number of scores σ2 = Σ(x-μ)^2 / N OR σ2 = σ^2

Answer 53

A line through a scatter plot that minimizes discrepancy between observed and predicted scores Measures the degree of mis-fit between scores

Answer 54

An estimated score for future tests Regression = y intercept + slope * X Y= aX+b OR Y= b0 + b1*X

Answer 55

The magnitude of differences between groups Effect size - mean of group 1 - mean of group 2 / standard deviation D = (x̄1 - x̄2) / s

Answer 56

Of the people who actually have the condition, how many were designated to have it Sensitivity - A / (A+C)

Answer 57

Of the people who don’t actually have the condition, how many were designated not to have it Specificity = D / (B+D)

Answer 58

Of the positive results, how many actually have the condition PPV = A / (A+B)

Answer 59

Of the negative results, how many really don’t have the condition NPV = D / (C+D)

Answer 60

The guaranteed rate of prevalence in a population

Answer 61

A test completed by someone who reports their own experiences

Answer 62

The beck depression index is a self report test that measures depression A unidimensional test, the use of cutoff scores indicates a discrete condition, any combination of items can be used to designate the presence of depression

Answer 63

A test completed on behalf of someone else

Answer 64

Tests that measure SUBCONSCIOUS impulses, emotions, difficulties, etc

Answer 65

Tests that use standardized measures that allow little to no interpretation Created to account for the limits of projective tests

Answer 66

Projective testin which the patient interprets inkblots

Answer 67

A test designed to measure individual aptitudes, attitudes, preferences, etc

Answer 68

The Meyers Briggs is a self report measure of psychological preferences in how people see the world and make decisions Measures innate aptitudes that are either mental or physical

Answer 69

Tests in which the questions and structure are predetermined, no changes or follow up can be made

Answer 70

Tests in which the procedure and questions are predetermined but the doctor is able to add in and take out questions up to their discretion

Answer 71

The structured clinical interview for DSM is a semi structured test that helps clinicians assess the presence or absence of psychiatric symptoms to render formal diagnoses It is semi structured, allowing for follow up and the adding/removing of questions

Answer 72

The way in which questions are asked and how tests are presented changes the amount of information that comes out of a test

Answer 73

How a doctor interprets the information to make conclusions that can result in changes between scores

Answer 74

Tests designed to asses personality characteristics

Answer 75

A test that measures the degree of OCEAN - openness, conscientiousness, extraversion, agreeableness, neuroticism Uses a likert scale for questions, multidimensional- assesses each personality characteristic based on multiple smaller factors

Answer 76

Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism

Answer 77

Minnesota Multiphastic Personality Index. Dsigned to address existing concerns on existing self-report measures, that assesses psychopathology and personality in a clinical setting, prioritizing criterion validity over face validity

Answer 78

A measure of how behaviour and personality traits correlate

Answer 79

The Behavioural Acts Inventory. Designed to measure actions and behaviours to identify the correlates with personality

Answer 80

Tests designed to measure quantitative personality characteristics, comparing them to patterns of normality

Answer 81

Intelligence tests for adults (WAIS) and children (WISC) which evaluates intelligence and cognitive ability

Answer 82

Tests that measure developed skills or knowledge

Answer 83

Graduate Record Examination that measures the acquired knowledge of students Evaluates verbal reasoning, quantitative reasoning, analytical writing, critical thinking, and knowledge

Answer 84

When it produces the same score continuously over time

Answer 85

How close our observed score approaches the true score

Answer 86

An estimate of true score

Answer 87

(E)rror*(x)observed=estimate of True

Answer 88

If error is uncorrelated with test scores, then error from two different tests is also uncorrelated, meaning errors from one test will be uncorrelated with the True Score of another test

Answer 89

Test-retest, Inter-rater, Parallel forms, Split half, Internal consistency

Answer 90

The ability for a test to produce consistent scores from one time to another

Answer 91

The degree to which different observers give consistent estimates of the same construct

Answer 92

The consistency of two separate but similar tests

Answer 93

The consistency between two halves of the same test

Answer 94

The consistency of the results across items of a test

Answer 95

By comparing two different groups of items

Answer 96

Within a single test - one part vs another part Across multiple test - test 1 vs test 2

Answer 97

Cronbach’s alpha (a) and Cohen’s Kappa (k)

Answer 98

(Observed agreement - chance agreement)/(1-chance agreement)

Answer 99

[probability of ‘yes’ from DR.a/probability of ‘yes’ from DR.b] X [probability of ‘no’ from DR.a / probability of ‘no’ from DR.b]

Answer 100

(‘Yes’ from both + ‘No’ from both) / N

Answer 101

The analysis of how each individual item on a test performs

Answer 102

That T = the average score on a test if taken repeatedly, that error is random and independent

Answer 103

0.9 > a > 0.8

Answer 104

0.8 > a > 0.7

Answer 105

0.7 > a > 0.6

Answer 106

0.6 > a > 0.5

Answer 107

The analysis of how each individual item performs and the correlation of individual items with the total score

Answer 108

To determine which items are the best measurement of a construct

Answer 109

An assessment of total score - the cumulative degree of agreement for a construct

Answer 110

Each individual score is averaged (across a ‘group’) for an item total. Each average item total is added and averaged for a total score. This average item agreement is plotted with the total score to find r (total, item)

Answer 111

When a items are more highly correlated with one factor than the others

Answer 112

The probability of choosing an option correlated with the level of a construct required to choose a given option

Answer 113

The amount of knowledge you need to get an answer right.

Answer 114

Discriminability, difficulty, precision

Answer 115

The slope. The point at which changes are easily observed

Answer 116

Better: steep slopes Worse: flattened regions

Answer 117

How much of the construct is needed before you choose that option (answer the question correctly)

Answer 118

Using the 0.5 threshold. The point on the x-axis at which the curve is at 0.5

Answer 119

More: when the slope is very shallow for a while, or it begins further down the x-axis Less: when the slope begins early on the d-axis and/or is very steep right away

Answer 120

An estimate of your level of ability

Answer 121

Using the area under the curve. The space between -2 to 2 (95%)

Answer 122

Based on the option picked, we can infer with 95% certainty that their severity level falls within the 95% of the area under the curve

Answer 123

Is it flat? Sharp? Where is the peak (most common area) Does one curve override another? Is a curve high for too long?

Answer 124

The examination of the degree to which individual items are related to one or more underlying dimensions of variation (factors)

Answer 125

Variable reduction Structural analysis

Answer 126

To reduce the redundancy in tests and see if the same construct can be better explained by a short form test

Answer 127

A visual representation of the relation of items to the factor(s) on a test

Answer 128

The red and blue squares of the NEO PI-R

Answer 129

Strong blue squares Using eigenvalues

Answer 130

Numbers that show the proportion of variance that each factor contributes

Answer 131

Any above 1

Answer 132

The ones under 1 or where the curve goes flat, the smallest correlation, if an item correlates to multiple factors,

Answer 133

By comparing two measures - an existing and a new -to a gold standard

Answer 134

Graphically, through models

Answer 135

Just the gold standard, measure 1, or measure 2 The single overlap: GS-M1, GS-M2, M1-M2 The total overlap

Answer 136

The ability to create a predicted score on the gold standard, based on observations on the other two+ measures

Answer 137

If adding this scale to the calculation of predicted score on the GS closes the gap between the predicted and observed score, there is incremental validity

Answer 138

The more tests you add, the closer you SHOULD be to the observed score on the GS

Answer 139

SSE= ΣN(y-ŷ)^2 Sum of squares of error = sum of (observed -predicted scores) squared

Answer 140

1. Benchmark - the existing tests vs the GS 2. (Existing test + new test) vs GS- does adding your test contribute anything

Answer 141

Data points are the observed scores on the GS Each measure has its line of best fit The space between a point and the line shows the discrepancy between observed and predicted scores

Answer 142

Compare it to a poor benchmark If the benchmark does a poor job when compared to the GS, it will make your scale look better

Answer 143

1. Both measures have incremental utility, one is not better than the other - retain both 2. One measure has more incremental utility than the other - keep the better measure 3. The measures do not contribute uniquely - choose one 4. The measures have completely unique proportions of variation - retain both

Answer 144

The CESD accounts for variance in the HRSD above and beyond the variance accounted for by the BDI

Answer 145

The examining of the structure of questionnaires and decision of what model best fits the data

Answer 146

Structural equation models

Answer 147

The imposition of a model on the data to evaluate fit

Answer 148

The factors of a construct that cannot be directly observed, they are inferred using related questions

Answer 149

The questions

Answer 150

Values that show how the latent variables relate to each other, and how the questions relate to the variables

Answer 151

Latent variables - circles - factors Factor loadings - top r score - correlations Error - bottom r scores

Answer 152

Explanatory model in which EVERYTHING is related The benchmark

Answer 153

A model in which none of the variables are correlated

Answer 154

Saturated: r = 1 Null: r = 0 Other: 1>r>0

Answer 155

Models can be uni-factoral and multi-factorial

Answer 156

Only one latent variable (circle)

Answer 157

Multiple latent variables (circles)

Answer 158

A model within another

Answer 159

By comparing the discrepancy between predicted and observed values to find which pattern of correlations is actually close to what has been observed

Answer 160

1. Creating a test that does not account for the behaviours of the target population 2. Not having enough items 3. Not using a test how it was intended

Answer 161

Teenscreen - used to screen teens for those at risk of suicide, but the at risk ones typically don’t show up

Answer 162

Responses might be wrong, there is nothing else to verify

Answer 163

Using the WISC to identify children that are gifted

Study Cards Flashcards

(189 cards)