Log on / register
Feedback | Support | My details
Open AccessTechnical advance

Privacy-preserving record linkage using Bloom filters

Rainer Schnell email, Tobias Bachteler email and Jörg Reiher email

Methodology Research Unit, Department of Social Sciences, University of Duisburg-Essen, D-47057 Duisburg, Germany

author email corresponding author email

BMC Medical Informatics and Decision Making 2009, 9:41doi:10.1186/1472-6947-9-41

Published: 25 August 2009

Abstract

Background

Combining multiple databases with disjunctive or additional information on the same person is occurring increasingly throughout research. If unique identification numbers for these individuals are not available, probabilistic record linkage is used for the identification of matching record pairs. In many applications, identifiers have to be encrypted due to privacy concerns.

Methods

A new protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers has been developed. The protocol is based on Bloom filters on q-grams of identifiers.

Results

Tests on simulated and actual databases yield linkage results comparable to non-encrypted identifiers and superior to results from phonetic encodings.

Conclusion

We proposed a protocol for privacy-preserving record linkage with encrypted identifiers allowing for errors in identifiers. Since the protocol can be easily enhanced and has a low computational burden, the protocol might be useful for many applications requiring privacy-preserving record linkage.


© 1999-2009 BioMed Central Ltd unless otherwise stated. Part of Springer Science+Business Media.