Lecture 2: Missing Data (Alt 2) Flashcards

(35 cards)

1
Q

What can happen if missing data is not properly addressed in psychological research?

A

It can seriously distort study results.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

What example is used to demonstrate the impact of different patterns of missingness on results?

A

Reported lifetime sexual partners, demonstrating how selective nonresponse due to stigma can drastically alter means and create misleading conclusions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

How does the lecturer distinguish missing data from selection bias?

A

Missing data refers to cases where participants omit specific responses, whereas selection bias refers to systematic issues in who participates in the study.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

What can selection bias lead to, according to the lecture?

A

Spurious correlations, such as those seen in examples involving acting talent and physical attractiveness, American college athletes, and health outcomes among smokers.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

What mechanism explains how selection bias creates false correlations?

A

Joint selection on two variables.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

What are the common sources of missing data discussed in the lecture?

A

Entire questionnaires being left incomplete, missing values on specific variables, and selective nonresponse from subsets of participants.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

What three categories of missing data are distinguished in the lecture?

A

Missing Completely at Random (MCAR), Missing at Random (MAR), and Missing Not at Random (MNAR).

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

What is the impact of MCAR and MAR on parameter estimates and statistical power?

A

MCAR and MAR do not bias parameter estimates but reduce statistical power and precision.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

What is the impact of MNAR on parameter estimates?

A

MNAR biases estimates and is difficult to detect and correct.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

What example is given for MCAR?

A

A random programming glitch.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What example is given for MAR?

A

Older people not responding to sexuality questions.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

What characterises MNAR, as discussed in the lecture?

A

The missingness is related to the unobserved value itself.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

How should researchers respond to missing data issues according to the lecture?

A

They should not hide them but transparently report them to enhance credibility.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

What is the first practical step in identifying missingness?

A

Descriptive checks such as per-participant and per-variable missingness rates.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

What statistical test is introduced to examine the randomness of missing data?

A

Little’s MCAR test.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

What does a significant result on Little’s MCAR test suggest?

A

Systematic missingness.

17
Q

What follow-up analysis is suggested after a significant MCAR test result?

A

Separate variance t-test to determine if the probability of missing data is predicted by a specific variable.

18
Q

What limitation is noted about statistical tests for missingness?

A

They are limited to measured variables, and large samples may yield trivial significance.

19
Q

What are the two broad strategies for handling missing data discussed in the lecture?

A

Deletion and substitution.

20
Q

What is listwise deletion?

A

Removing any case with missing data from all analyses — appropriate when less than 5% of data is missing.

21
Q

What is pairwise deletion?

A

Removing cases only from specific analyses that include the missing variable.

22
Q

What is a potential issue with pairwise deletion?

A

It can result in inconsistencies across analyses.

23
Q

What is a key limitation of mean substitution?

A

It underestimates variability and inflates false positives.

24
Q

What is a flaw in regression substitution?

A

It underestimates standard errors.

25
What does Expectation Maximisation (EM) use for estimation?
Iterative estimation based on known data (e.g., means, variance, and covariances).
26
What does EM add to variance and covariance estimates?
Noise to simulate variability.
27
How does EM generate substitutions for missing data?
It generates regression equations that relate each variable to other variables and uses this to provide substitutions.
28
What is the final step in the EM procedure?
Reestimating sample statistics using completed data, repeating the process until re-estimated statistics match the original estimates.
29
How does Multiple Imputation (MI) improve upon EM?
It generates multiple complete datasets (using EM algorithm) and averages across them, preserving variability and better standard error estimates.
30
What does MI add to data based on existing error distributions?
Noise.
31
Despite its complexity, how is MI regarded in handling missing data?
It is considered the gold standard.
32
What is the implication if data are MNAR?
No statistical fix can fully resolve the issue.
33
How should MNAR cases be addressed?
Through theoretical reasoning, sensitivity analysis, and transparency about limitations.
34
What key distinctions should researchers make regarding missing data?
Distinguishing between bias and precision loss.
35
What final recommendation is made regarding reporting missing data procedures?
Researchers should clearly document all procedures and assumptions related to missing data in research reports.