University of Cambridge > > Isaac Newton Institute Seminar Series > Strategies to facilitate access to detailed geocoding information based on synthetic data

Strategies to facilitate access to detailed geocoding information based on synthetic data

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact INI IT.

DLA - Data linkage and anonymisation

In this seminar we investigate if generating synthetic data can be a viable strategy to provide access to detailed geocoding information for external researchers without compromising the confidentiality of the units included in the database. This research was motivated by a recent project at the Institute for Employment Research (IAB) that linked exact geocodes to the Integrated Employment Biographies, a large administrative database containing several million records. Based on these data we evaluate the performance of several synthesizers in terms of addressing the trade-off between preserving analytical validity and limiting the risk of disclosure. We propose strategies for making the synthesizers scalable for such large files, introduce analytical validity measures for the generated data and provide general recommendations for statistical agencies considering the synthetic data approach for disseminating detailed geographical information.

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.


© 2006-2022, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity