Stats & Test Construction Flashcards Preview

Question 1

Q

Type I error

Answer

A

Mistakenly rejecting the null hypothesis when it’s true

Alpha

Question 2

Q

Type II error

Answer

A

Mistakenly retaining the null hypothesis when it is false

Beta

Question 3

Q

Discriminant analysis

Answer

A

Technique in multivariate statistics that describes differences between 2+ groups on a set of measures or that classifies subjects into groups based on a set of measures

Question 4

Q

Threats to internal validity

Answer

A

Maturation, history, instrumentation, statistical regression, selection, attrition/mortality, interaction w/ selection

Question 5

Q

Ways to control threats to internal validity

Answer

A

Random assignment, within-subjects designs, blocking, matching subjects, ANCOVA

Question 6

Q

Threats to external validity

Answer

A

interaction b/t testing & treatment, interaction b/t selection & tx, reactivity, multiple tx interference (order/carryover effects)

Question 7

Q

Ways to control external validity

Answer

A

Random sampling, naturalistic/field research, single or double-blind designs, counterbalance

Question 8

Q

What are some ways to increase power?

Answer

A

Increase alpha, increased N, increase effect size, decrease error, use powerful statistics, one-tailed if possible

Question 9

Q

What percentage of scores on the normal curve fall between +/- 1 SD, +/- 2 SD, +/- 3 SD?

Answer

A

68%
95%
99%

Question 10

Q

What percentiles are equivalent to the following z-scores?
-3
-2
-1
1
2
4

Answer

A

0.1 = -3
2 = -2
16 = -1
84 = 1
98 = 2
99.9 = 3

Question 11

Q

Factors affecting test reliability

Answer

A

Test characteristics (length, item type, item homogeneity, influence of guessing), sample characteristics (sample size, range, variability), extent of test clarity

Question 12

Q

Sources of error in internal reliability

Answer

A

Content sampling, heterogeneity of content domain

Question 13

Q

Sources of error in test-retest reliability

Answer

A

Time-sampling factors

Question 14

Q

Which type of reliability is best for speed tests?

Answer

A

Alternate forms

Question 15

Q

Sources of error in inter-rater reliability

Answer

A

Factors related to raters (motivation, biases), characteristics of measuring device, consensual observer drift

Question 16

Q

Dimensions of relevance in item analysis

Answer

A

1) Content appropriateness (item assesses bx domain the test is intended to evaluate)
2) Taxonomic level (does item reflect appropriate cognitive or ability level of population intended for)
3) Extraneous abilities (to what extent are knowledge or skills needed that is outside the domain being evaluated)

Question 17

Q

Item difficulty

Answer

A

The %age of people who get an item correct

Question 18

Q

Item discrimination

Answer

A

Extent an item differentiates between those who get a high vs. low score

.35 or more is acceptable

Question 19

Q

Item response theory

Answer

A

Tests based on examinee’s level on the trait being measured vs total test score

Question 20

Q

Reliability coefficient

Answer

A

Proportion of variability in obtained test scores that reflects true score variability

Never squared to interpret

Question 21

Q

Standard error of measurement (SEM)

Answer

A

An index of the amount of error that can be expected in a person’s obtained scores due to the unreliability of the test

Question 22

Q

What qualitative evidence do you look for in a task that has good content validity?

Answer

A

Coefficient of internal consistency will be large

Test will correlate highly with other tests of the same domain

Pre- and post-test evals of the program designed to increase familiarity with domain will indicate appropriate changes

Question 23

Q

Orthogonal rotation

Answer

A

Resulting factors are uncorrelated; attribute measured by one factor is independent from the attributes measured by the other factor

Question 24

Q

Oblique rotation

Answer

A

Resulting factors are correlated & attributes measured by the factors are not independent

Question 25

Q

What is the Rosenthal/Pygmalion effect?

Answer

A

Tendency for participant’s performance to be effected by the expectations of the tester

Question 26

Q

What is the Hawthorne effect?

Answer

A

Tendency of subjects to behave differently when they are in a research study

Question 27

Q

What is the most common measure for internal test reliability?

Answer

A

Cronbach’s alpha (can’t be used for dichotomous tests)

Question 28

Q

What measure is used to evaluate the effect of lengthening or shortening a test?

Answer

A

Spearman-Brown correction formula

Question 29

Q

What formula is used to assess the reliability of a test with dichotomous responses?

Answer

A

Kuder-Richardson formula

Question 30

Q

What are acceptable scores of reliability?

Answer

A

.80 & above = good
.70-79 = acceptable
.60-.69 = marginally reliable
.59 and below = not reliable

Question 31

Q

Name the 4 scales of measurement

Answer

A

1) Nominal = names of categories
2) Ordinal = rank data
3) Interval = no absolute 0, numbers scaled at equal distances
4) Ratio = has absolute 0

Question 32

Q

What are the assumptions of parametric statistics?

Answer

A

Normal distribution
Homogeneity of variance (variance equal among all groups)
Independence of observations

Question 33

Q

F-ration in a one-way ANOVA

Answer

A

Ratio of between group to within group variance

Question 34

Q

Moderator variable

Answer

A

Relationship of A and C depends on the value of B (the moderator)

Question 35

Q

Mediating variable

Answer

A

Accounts for (or partially accounts for) a relationship b/t an IV and DV

Relationship between A and C decreases or is eliminated when B is included in the model

Question 36

Q

What is the null hypothesis in Chi-square?

Answer

A

Observed frequencies are randomly distributed

Alternate hypothesis is that the observed frequencies are related to the treatment effect

Question 37

Q

Central limit theorem

Answer

A

As sample size increases, shape of sampling distribution of sample means approximates a normal distribution.

Mean of sampling distribution of sample means = mean of population.

Question 38

Q

What factors affect Pearson’s product moment correlation?

Answer

A

Linearity (assumes linear relationship b/t 2 variables)

Homoscedasticity (scores are equally distributed)

Range of scores (wider range provides more accurate estimate)

Question 39

Q

Point-biserial coefficient

Answer

A

Correlation between one continuous variable & one dichotomous variable

Question 40

Q

Phi coefficient

Answer

A

Correlation b/t 2 dichotomous variables

Question 41

Q

Assumptions of regression

Answer

A

Linear relationship b/t X and Y

Homoscedasticity (error scores of criterion are the same across range of x)

Homogeneity of variance

Question 42

Q

Multicollinearity

Answer

A

Degree to which predictors correlate with each other

Decreases the accuracy of the regression equation

Question 43

Q

Sensitivity

Answer

A

TP/TP + FN

Question 44

Q

Specificity

Answer

A

TN/TN + FP

Question 45

Q

Positive likelihood ratio

Answer

A

Indicates the odds that a positive test comes from a true positive (a PLR of 3 means that a pt w/ a +predictor is 3x as likely to have the condition)

Sensitivity/1-specificity

Question 46

Q

Positive predictive power

Answer

A

Probability that a pt with a + test has the true condition

TP/TP + FP

Question 47

Q

Negative predictive power

Answer

A

Probability that a pt with a negative test result does not have the condition

TN/TN + FN

Question 48

Q

Relationship between base rate & PPP/NPP

Answer

A

As the base rate increases, PPP will increase, whereas NPP will decrease. Converse is true as the base rate declines.

Question 49

Q

Bayes theorem

Answer

A

Often employed in decision analysis, allowing calculation of the posterior probability of an event (conditioned probability it is assigned when the relevant evidence is taken into account)

Question 50

Q

Item characteristic curve

Answer

A

Plot the proportion of ppl who answered correctly against the total test score, performance on an external criterion, or mathematically-derived estimate of ability; provides info on relationship between examinee’s level on the trait measured by the test & the probability that he will respond correctly on that item

Question 51

Q

Which ANOVA post-hoc correction is most conservative?

A

Scheffe

Question 52

Q

Which ANOVA post-hoc correction is appropriate for pairwise comparisons?

A

Tukey

Question 53

Q

Mann-Whitney U

Answer

A

Compare two independent groups on a DV measured with rank-ordered data

Question 54

Q

Negative skew

Answer

A

Most scores are high but few extreme low scores; mean < median < mode; easy test, ceiling effects

Question 55

Q

Positive skew

Answer

A

Most scores are low but few extreme high scores; mean > median > mode; difficult test, floor effects

Question 56

Q

Variance

Answer

A

Average of the square differences of each observation from the mean

Question 57

Q

Null hypothesis in ANOVA

Answer

A

Group means were drawn from the same population (i.e., means are equal in the population)

Question 58

Q

What factors may lead to non-normal test distributions?

Answer

A

1) existence of discrete subpopulations w/i the general population w/ differing abilities
2) ceiling or floor effects
3) tx effects that change the location of means, medians & modes, affect variability & distribution shape

Question 59

Q

How is SEM related to test reliability?

Answer

A

The greater the reliability, the smaller the SEM

Question 60

Q

Reliable change index (RCI)

Answer

A

Indicator of the probability that an observed difference b/t 2 scores from the same examine on the same test can be attributed to measurement error

Stats & Test Construction Flashcards Preview

ABPP > Stats & Test Construction > Flashcards

Decks in ABPP Class (34):

Brainscape's Knowledge GenomeTM

Stats & Test Construction Flashcards Preview

ABPP > Stats & Test Construction > Flashcards

Brainscape's Knowledge Genome^TM