Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Noise-Aware Differentially Private Synthetic Data

Add to your list(s) Download to your calendar using vCal

Antti Honkela, University of Helsinki
Tuesday 28 June 2022, 11:00-12:00
Hybrid, CBL Seminar room, Department of Engineering, and Zoom https://eng-cam.zoom.us/j/89002493651?pwd=B_2gKl7va_h0CQ9yoMPSbn2ifYLGi4.1.

If you have a question about this talk, please contact Dr R.E. Turner.

Synthetic data generated under differential privacy (DP) promises to significantly simplify analysis of sensitive personal data. Existing work has shown that simply analysing DP synthetic data as if it were real does not produce valid inferences of population-level quantities, leading to too narrow confidence intervals and thereby risking false discoveries. We propose using multiple imputation techniques to avoid these problems. This requires simulating multiple synthetic data sets from the Bayesian posterior predictive distribution over data sets. We propose a novel noise-aware Bayesian DP synthetic data generation mechanism for discrete data that enables generating such a distribution of data sets. Our experiments demonstrate that the method is able to produce accurate confidence intervals from DP synthetic data.

This talk is part of the Machine Learning @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Noise-Aware Differentially Private Synthetic Data

This talk is included in these lists:

Other lists

Other talks