Strategies to facilitate access to detailed geocoding information based on synthetic data
- π€ Speaker: Joerg Drechsler (Institut fΓΌr Arbeitsmarkt-und Berufsforschung)
- π Date & Time: Thursday 01 December 2016, 15:30 - 16:30
- π Venue: Seminar Room 2, Newton Institute
Abstract
In this seminar we investigate if generating synthetic data can be a viable strategy to provide access to detailed geocoding information for external researchers without compromising the confidentiality of the units included in the database. This research was motivated by a recent project at the Institute for Employment Research (IAB) that linked exact geocodes to the Integrated Employment Biographies, a large administrative database containing several million records. Based on these data we evaluate the performance of several synthesizers in terms of addressing the trade-off between preserving analytical validity and limiting the risk of disclosure. We propose strategies for making the synthesizers scalable for such large files, introduce analytical validity measures for the generated data and provide general recommendations for statistical agencies considering the synthetic data approach for disseminating detailed geographical information.
Series This talk is part of the Isaac Newton Institute Seminar Series series.
Included in Lists
- All CMS events
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- dh539
- Featured lists
- INI info aggregator
- Interested Talks
- Isaac Newton Institute Seminar Series
- ndk22's list
- ob366-ai4er
- rp587
- School of Physical Sciences
- Seminar Room 2, Newton Institute
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Joerg Drechsler (Institut fΓΌr Arbeitsmarkt-und Berufsforschung)
Thursday 01 December 2016, 15:30-16:30