University of Cambridge > Talks.cam > MRC Biostatistics Unit Seminars > BSU Seminar: "Sparse Hamiltonian Flows (or: Bayesian Coresets Without all the Fuss)"

BSU Seminar: "Sparse Hamiltonian Flows (or: Bayesian Coresets Without all the Fuss)"

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Alison Quenault.

This will be a virtual seminar. FREE registration: https://us02web.zoom.us/meeting/register/tZ0rc-2vpz4qHNxdJ4Cm4hyLEn4BVOG5YcIt

Bayesian inference provides a coherent approach to learning from data and uncertainty assessment in complex, expressive statistical models. However, algorithms for performing inference have not yet caught up to the deluge of data in modern applications. One approach—-Bayesian coresets—-involves replacing the large dataset with a small, weighted, representative subset of data during inference. The coreset is designed to capture the information from the full dataset, but be much less computationally expensive to store in memory and iterate over. Although the methodology is sound in principle, efficiently constructing such a coreset in practice remains a significant challenge: current methods tend to be complicated to implement, slow, require a secondary inference step after coreset construction, and do not enable model selection. In this talk, I will introduce a new method—-sparse Hamiltonian flows—-that addresses all of these challenges. The method involves first subsampling the data uniformly, and then optimizing a Hamiltonian flow parametrized by coreset weights and including periodic momentum quasi-refreshment steps. I will present theoretical results demonstrating that the method enables an exponential compression of the dataset in representative models, and that the quasi-refreshment steps reduce the KL divergence to the target. Real and synthetic experiments demonstrate that sparse Hamiltonian flows provide accurate posterior approximations with significantly reduced runtime compared with competing dynamical-system-based inference methods.

This talk will be based on two papers that are available online as preprints: Chen et al, “Bayesian coresets via sparse Hamiltonian flows,” https://arxiv.org/abs/2203.05723 Naik et al, “Fast Bayesian coresets via subsampling and quasi-Newton refinement,” https://arxiv.org/abs/2203.09675

This talk is part of the MRC Biostatistics Unit Seminars series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2022 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity