Embedded Agency
- 👤 Speaker: Adrià Garriga Alonso (University of Cambridge)
- 📅 Date & Time: Wednesday 30 January 2019, 17:00 - 19:00
- 📍 Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38
Abstract
Most current theories of agency deal only with dualistic or Cartesian agents. That is, the agent that makes the decisions is itself outside of the world that it takes decisions in, and is not affected by it.
But technically this is not the case, as the algorithm that makes decisions is implemented using something (a brain, a computer) in the world, and modifications to that something can change the algorithm.
According to some views, understanding the theory behind embedded, non-Cartesian agency, is key to solving some important problems in AI safety (provably aligned self-modification, wireheading). Others think it’s not so important. We shall learn about current attempts to build embedded agency theories and discuss how important it is to continue work in that area.
Reading list:
- Embedded Agency sequence from MIRI : https://www.lesswrong.com/s/Rm6oQRJJmhGCcLvxh/p/i3BTagvt3HbPMx6PN
Series This talk is part of the Engineering Safe AI series.
Included in Lists
- Cambridge talks
- Cambridge University Engineering Department, CBL Seminar room BE4-38
- Chris Davis' list
- Engineering Safe AI
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 30 January 2019, 17:00-19:00