Data Entry Flashcards

1
Q

When collecting data with paper surveys, what data entry options might be available?

A

Manual data entry of data from paper surveys
Scanning paper surveys and having data entered remotely
Optical character recognition

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
2
Q

Survey specifications include

A
  • A well-designed, clean survey
  • Unique IDs and coding, in this case automatic coding
  • Variable values, skip patterns, logical checks etc.
How well did you know this?
1
Not at all
2
3
4
5
Perfectly
3
Q

Entering mock values to ensure responses can be entered…

What type of software check?

A

Bench testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
4
Q

Entering mock values to ensure skips work properly…

What type of software check?

A

Bench testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
5
Q

Checking that batteries have a long-enough life for field work

What type of software check?

A

Device testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
6
Q

Checking that data entered are stored accurately

What type of software check?

A

Data flow testing

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
7
Q

Why is it important to include special codes, and why in the form of: -999, -888, etc?

A

Special codes such as -999, etc, are for missing values. We want to know whether a value is missing because a respondent not knowing the answer suggests a very different interpretation from refusing to answer. Using a negatives allows us to not confuse this missing response with real data (eg age). The other options relate to unique IDs and questionnaire design.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
8
Q

Which process is used for creating digital data collection software but not for creating manual data entry software

A

Device testing (e.g. battery life, extreme conditions)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
9
Q

If the surveyor records a respondent’s response incorrectly on the questionnaire, at what point will this likely be caught in the data entry process

A

It won’t.

The double data entry process is designed to catch data entry error, not surveyor error.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
10
Q

After entering our “audit” sample, to which dataset should we compare the resulting audited data?

A

The reconciled dataset from the first two entries after correcting for errors

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
11
Q

What are the benefits of outsourcing data entry operations?

A

We would not need to invest in hardware (computer, power backups, etc)

We would not need to invest time in learning how to program software (assuming we have no prior knowledge)

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
12
Q

When conducting data entry operations in-house, we should always invest in a power backup solution (e.g. Uninterrupted Power Supply or UPS) so that, at a minimum…

A

We have enough time to save whatever data has been entered and we can safely close the program

If we need electricity for 24 hours/day or 10 hours/day, we want to select a location that has a reliable power supply, and should not rely on self-generated power, or a power backup. The power back up is primarily to ensure we do not lose data we’ve already entered.

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
13
Q

What is the main reason it is unadvisable to pay data entry operators (DEOs) per survey entered?

A

It encourages speed over quality

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
14
Q

For large scale data collection, why might we require paper surveys over digital data collection?

A

If we do not have enough time to work on digital data collection software prior to survey launch

If we are administering exams to children

If we are collecting data from within a factory that does not allow digital devices for fear of losing intellectual property

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
15
Q

Questionnaire –> 1st Entry –> 2nd Entry –> XXXXX –> Reconciliation –> Complete Dataset

Which step is missing in the data entry process?

A

Identification of discrepancies

How well did you know this?
1
Not at all
2
3
4
5
Perfectly
16
Q

When conducting double data entry, what is the main reason we want to use two separate data entry operators (DEOs)?

A

There is a lower likelihood of different DEOs making the same mistake twice

17
Q

Bench testing

A
  • In-office testing of the survey and usually involves 3 to 6 evaluators
  • Assign different scenarios to each person, considering your skip patterns
18
Q

Device Testing

A

• Test these devices thoroughly
• Battery testing for extreme scenarios
• Retest! Retest! Retest! until devices and survey are
cooperating.
– Enter and exit at different points in the survey
– Try everything to make the device malfunction
– Make any necessary corrections.

19
Q

Data flow Testing

A

• Check the flow of data
– From exporting data from devices to clean and merged data
– Should be done in conjunction with bench testing and device testing
• Inspect and examine dataset with Stata/R

20
Q

Ideal error rate when auditing surveys?

A

Error rate less than 0.5%

Reconcile results if over 0.5%