Missing data Flashcards

(26 cards)

1
Q

what is missing data?

A

Often don’t get information for every measure from every participants
Variables we study may contain missing values/ information
This includes error in data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

why is missing data important?

A

Missing data points could be meaningful for analysis if observed
This could potentially affect our analysis as it may conceal a meaningful value for understanding a problem
We need to identify this and determine the nature of the missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

what are the different reasons why missing data may occur?

A

non-response
data entry errors
instrumentation issues
privacy concerns
natural causes

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

what is non-response?

A

a reason why missing data may occur if participants choose not to answer certain questions or fail to provide information

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

what is data entry errors?

A

a reason why missing data may occur if there are mistakes made by researchers during the data collection or entry process

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

what is instrumentation issues?

A

a reason why missing data may occur if there are problems with the tools or instruments used to collect data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

what is privacy concerns?

A

a reason why missing data may occur if sensitive information is omitted from a dataset to protect participants and to maintain ethics

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

what is natural causes?

A

a reason why missing data may occur if there are events beyond control e.g. technical issues, power outages, environmental factors can lead to missing data

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

what are the different types of missing data?

A

non-systematic
systematic

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

what are the types of non-systematic missing data?

A

missing completely at random (MCAR)
missing at random (MAR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

what are the different types of systematic missing data?

A

missing not at random (MNAR)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

explain what MCAR is

A

The fact data is missing is independent of the observed and unobserved value
The missing data reduces the analysable population of the study
Reduces the statistical power but does not introduce bias
Can be considered a simple random sample of the full data set

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

explain what MAR is

A

The missing data is systematically related to the observed but not the unobserved values
Can occur if probability of completion of the survey is related to their sex which is fully observed but not the severity of their depression

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

explain what MNAR is

A

Missing data is systematically related to the unobserved value
Analysis of a dataset containing MNAR data
Likely to result in biased estimates

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

why is missing data a problem?

A

Can have significant effects on statistical analyses
May lead to biased estimates
Can affect the story we try to tell about our participants and who they represent in the wider population

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

why do we report missing data?

A
  • Scientific research relies of clear reporting of approach and analysis so we need to show researchers and readers our procedure
  • Aid transparency and reproducibility
  • Help readers understand potential limitations with our statistical outputs
  • Helps to draw reasonable conclusions about our study
17
Q

what missing data do we report?

A
  • The number of participants who had no data on either measure
  • The number of participants who had no data on each measure
  • Also specify the reasons for the missing data
18
Q

why do we specify the reasons for the missing data?

A

to provide insights into whether the missingness is random or systematic

19
Q

what are some ways in which we can identify missing data?

A
  • Check within database and scan to look for the missing or erroneous value
  • Missing value analysis in SPSS
  • Visualise the data and check for the missing data
20
Q

what are some other approaches to missing data?

A

complete case analysis (CCA)
mean/median/mode imputation
multiple imputation
weighted estimation
model-based methods

21
Q

what is complete case analysis?

A
  • involves excluding cases with missing data from analysis
  • Can lead to biased results if missing data is not completely random
  • Difficult to make inferences about entire target population or subpopulations
22
Q

what is mean/median/mode imputation?

A
  • Filling missing values with the mean, median or mode of the observed values for that variable
  • Generally inappropriate as can reduce the variability of your data
  • Can affect measurements of covariances and correlations
  • No straightforward way to estimate standard errors
23
Q

what is multiple imputation?

A

creating, analysing and combing multiple complete datasets with imputed values

24
Q

what is weighted estimation?

A

using statistical models to estimate missing values

25
what is model-based methods?
using statistical models to estimate missing values
26
why are these other approaches to missing data important?
In addition to describing and reporting missing data, various methods can be used to estimate missing values The choice of method depends on the nature of the data and the study questions