Building Reproducible Machine Learning Pipelines for inference of Galaxy Properties at Scale
- 👤 Speaker: Gurjeet Jagwani - IoA, University of Cambridge
- 📅 Date & Time: Thursday 23 October 2025, 13:00 - 14:00
- 📍 Venue: Room E, West Hub
Abstract
Next-generation astronomical surveys like the Legacy Survey of Space and Time (LSST) will deliver billions of galaxy observations crucial for understanding dark matter and dark energy. However, extracting reliable galaxy properties like redshifts from these data requires scalable computational approaches that can handle these enormous datasets while maintaining scientific rigour and reproducibility. We present pop-cosmos, a forward-modelling framework for photometric galaxy survey data that constrains population-level galaxy properties up to redshift 6. Galaxies are modelled as draws from a population prior over physical parameters (redshift, stellar mass, star formation history, dust properties), mapped to observed colors and brightness using neural emulators of complex astrophysical models—achieving 10000x speedups. We use simulation-based inference to calibrate this population prior on deep multi-wavelength data (COSMOS2020), training a diffusion model to match the statistical properties of real survey data. The resulting model helps us understand and probe various astrophysical and cosmological phenomena. Central to our framework is flowfusion, a general-purpose library for density estimation and generative modelling that implements state-of-the-art machine learning methods including diffusion models and flow-matching. I will demonstrate how our open-source toolkit enables reproducible results from our scientific applications and discuss ongoing work with the Kilo-Degree Survey in preparation for LSST .
Series This talk is part of the RSE Seminars series.
Included in Lists
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Chris Davis' list
- Interested Talks
- ndk22's list
- ob366-ai4er
- Room E, West Hub
- rp587
- RSE Seminars
- se393's list
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Gurjeet Jagwani - IoA, University of Cambridge
Thursday 23 October 2025, 13:00-14:00