Observation and Intervention Incentives in Causal Influence Diagrams: Towards an Understanding of Powerful Machine Learning Systems
- π€ Speaker: Tom Everitt (DeepMind)
- π Date & Time: Friday 25 January 2019, 11:00 - 12:00
- π Venue: Engineering Department, LR5 (in front of library, 1st floor)
Abstract
As machine learning systems gain in capability and complexity, understanding their incentives will become increasingly important. In this paper, we model their objectives and environment interaction in graphical models called influence diagrams. This allows us to answer two fundamental questions about the incentives of a machine learning system directly from the graphical representation: (1) which nodes would the system like to observe in addition to its observations, and (2) which nodes would the system like to control in addition to its actions? The answers tell us which information and influence points need extra protection, and have applications to fairness and reward tampering. For example, we may want a classifier for job applications to not use the race of the candidate, and a reinforcement learning agent not to take direct control of its reward mechanism. Different algorithms and training paradigms can lead to different influence diagrams, so our results can help designing algorithms with less problematic observation and intervention incentives.
Series This talk is part of the Machine Learning @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- Biology
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge Neuroscience Seminars
- Cambridge talks
- CBL important
- Chris Davis' list
- Creating transparent intact animal organs for high-resolution 3D deep-tissue imaging
- dh539
- dh539
- Engineering Department, LR5 (in front of library, 1st floor)
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Joint Machine Learning Seminars
- Life Science
- Life Sciences
- Machine Learning @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- Neuroscience
- Neuroscience Seminars
- Neuroscience Seminars
- ob366-ai4er
- Required lists for MLG
- rp587
- Seminar
- Simon Baker's List
- Stem Cells & Regenerative Medicine
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Friday 25 January 2019, 11:00-12:00