City Research Online

Privacy-Preserving Record Linkage and Privacy-Preserving Blocking for Large Files with Cryptographic Keys using Multibit Trees

Schnell, R. (2013). Privacy-Preserving Record Linkage and Privacy-Preserving Blocking for Large Files with Cryptographic Keys using Multibit Trees. Paper presented at the Joint Statistical Meeting, 3-8 Aug 2013, Montreal, Canada.

Abstract

Increasingly, administrative data is being used for statistical purposes, for example registry based census taking. In practice, this usually requires linking separate files containing information on the same unit, without revealing the identity of the unit. If the linkage has to be done without a unique identification number, it is necessary to compare keys which are derived from unit identifiers and which are assumed to be similar. When dealing with large files like census data or population reg- istries, comparing each possible pair of keys of two files is impossible. Therefore, special algorithms (blocking methods) have to be used to reduce the number of comparisons needed. If the identifiers have to be encrypted due to privacy concerns, the number of available algorithms for blocking is very limited. This paper describes the adoption of a recently introduced algorithm for this problem and its performance for large files.

Publication Type: Conference or Workshop Item (Paper)
Publisher Keywords: Record Linkage; Privacy Preserving Record Linkage; Entity Resoultion; PPRL; Blocking
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments: School of Arts & Social Sciences > Sociology
URI: http://openaccess.city.ac.uk/id/eprint/14431
[img]
Preview
Text - Published Version
Download (262kB) | Preview

Export

Downloads

Downloads per month over past year

View more statistics

Actions (login required)

Admin Login Admin Login