Influence Functions
- π€ Speaker: Adrian Goldwaser, Bruno Mlodozeniec, Runa Eschenhagen, University of Cambridge
- π Date & Time: Wednesday 05 March 2025, 11:00 - 12:30
- π Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38.
Abstract
When attempting to understand the behaviour of a machine learning model, a common question is: how did the training examples contribute to a model output? Which examples contributed the most? This can also be framed as a counterfactual question: how would the final model outputs change upon removal of some examples from the training set? The goal of training data attribution (TDA) methods like influence functions, which will be the subject of this talk, is to answer precisely this question. In this talk, we will give an introduction to influence functions, discuss challenges and approaches to scalability, and give examples of practical applications. We will show that solving the aforementioned data attribution problem can be extremely useful. It can help identify pernicious data β from mislabelled examples, data responsible for undesirable behaviours (e.g. profanity or explicit content) through to data poisoning attacks. Influence functions can help understand memorisation in neural networks, providing mitigations to privacy and copyright concerns, along with fair data valuation. Influence functions can answer the above TDA problem efficiently without retraining, using only the local information about the training loss function around the final model parameters. They have been successfully used for these tasks for models ranging from 50 billion parameter Large Language Models to modern diffusion models.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38.
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Adrian Goldwaser, Bruno Mlodozeniec, Runa Eschenhagen, University of Cambridge
Wednesday 05 March 2025, 11:00-12:30