Chapter 6: Test Construction Flashcards

Question

Pilot testing

Answer 1

- pilot testing with a sample from the population - interest lies in the reactions of the sample to each of the questions, and can be achieved using focus groups, where difficulties with the items—ranging from wording to cultural appropriateness—can be identified.

Answer 2

Tryout | Item Analysis

Answer 3

- Administer the test on a representative sample - The recommended sample size varies - usually 100+ - Use standardised instructions - The test ‘tryout’ data is then used to narrow down the number of items

Answer 4

* the process of studying the behaviour of items when administered to a group of respondents, usually with a view to the selection of some of the items to form a psychological test - The items will have been reviewed and edited prior to administration using local experts and a pilot (i.e. a small-scale sample) study with members of the intended population. Qualitative - larger sample would be asked to comment on the items (readability, comprehensibility, clarity and apparent strangeness. Quantitative - how the items ‘behave’ when people are asked to complete them. - depends on the measurement model but typically the focus is on item difficulty and item discrimination - reliability and validity - Dimensionality (i.e. factor analysis)  whether all the items within the scale are measuring the same underlying construct or latent variable; or whether there is more than one construct

Answer 5

the correlation between an item and score on an external criterion being used to validate the test.

Answer 6

Two techniques used for CTT measurement model o exploratory factor analysis - If only one construct has been targeted in the test then EFA should show one strong factor. o If more than one construct is being examined in the test, then more than one factor should emerge with notable factor loadings for items as specified in the construct specification and test plan. oCronbach’s alpha would be calculated and evaluated in the light of guidelines for test use Using a number of representative samples allows one to check the replicability of the findings and provides increased confidence that the decisions being made about the test are sound.

Answer 7

recommended in all scale development processes (psychological) - New scale development usually starts with exploratory factor analysis (EFA) to identify a manageable number of factors to extract - Confirmatory factor analysis (CFA) used when number of factors is known. HELPS: - Determine the number of underlying latent variables or constructs - condense information (find out what is actually the underlying factors) - Define the content or meaning of the factors - identify items that are performing better or worse - Items that do not fit into any factor, or those that fit into more than one can be considered for elimination Number of factors to extract o Eigenvalues (> 1) ; those factors that are greater than one, you usually consider retaining as a factor o Scree Plot Rotation - Helps interpret the data - Oblique: assumes factors are correlated - Orthogonal: assumes factors are uncorrelated

Answer 8

Are the items ‘homogenous?’  Correlation between the score for the test item and the scale score (item-scale correlations)  Inter-relatedness of the test items (Cronbach’s alpha) Can help to identify items to be discarded  ‘Outlier’ items  Items that are incongruous with the test

Answer 9

``` Test has been: - Conceptualised - Constructed - Tried - Analysed - Revision: o A stage in new test development o A stage in modifying an existing test ``` Those checks - Internal Consistency (IT needs to stay the same across new and old test) - Factor Analysis - Cross-Validation - Collection of additional criterion-related validity data - Does the test predict the criterion in the new sample as well as it did in the old?

Answer 10

- Is the test applicable to this population? - Realistic test manual information Validity Shrinkage: - Often lower validity the second time around - Inevitable - Generally a slight difference - Eliminating chance results - Near enough is good enough!

Answer 11

- Norming the test in a representative population – General population vs specific population (i.e. cultural group, patients diagnosed with a X disorder) will depend on the use of the test – Often stratified by age and gender oExplicit  norms are prepared in such a way that these variables are identified in the tables that are prepared. - Creating a test manual/instructions - Publication

Chapter 6: Test Construction Flashcards

(35 cards)