12. Data Linkage & Integration Flashcards
1
Q
what are the ethical needs of data linkage
A
- need multi-site ethics applications detailing the requirements
- data may be shared to other parties
- data leaks
- data may need to be decrypted for linkage
2
Q
what are the technical needs of data linkage
A
data consistency & accessibility etc.
3
Q
what is the linkage process
A
splitting –> linkage –> integration –> research
- separate dataset into ID & content
both files are stored & handled separately - multiple matching methods applied to identify/distinguish
give records an arbitrary but unique ID number - use encrypted version of person number to derive project specific person number
join PPN to content so one individual has multiple content matches - now researchers can combine records for an individual. they won’t see ID info, only PPN
4
Q
what is the importance of hc metadata
A
helps to find, authenticate, understand, trust, use & manage info
5
Q
illustrate the choice maker example
A
choice maker has 3 options: match, different & hold
match = exact match between both records, phonetic match, match if a few letters are transposed, are values nicknames for each other
differ = are values completely different
hold = is one/both of the values missing/invalid/placeholder