University of Cambridge > Talks.cam > Engineering Safe AI > Reinforcement learning with a corrupted reward function

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Reinforcement learning with a corrupted reward function

Download to your calendar using vCal

Tom McGrath, Imperial College London
Wednesday 29 November 2017, 17:00-18:30
Cambridge University Engineering Department, CBL Seminar room BE4-38. For directions see http://learning.eng.cam.ac.uk/Public/Directions.

If you have a question about this talk, please contact Adrià Garriga Alonso .

No real-world reward function is perfect. Sensory errors and software bugs may result in RL agents observing higher (or lower) rewards than they should. For example, a reinforcement learning agent may prefer states where a sensory error gives it the maximum reward, but where the true reward is actually small. Two ways around the problem are investigated.

This talk is part of the Engineering Safe AI series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Reinforcement learning with a corrupted reward function

This talk is included in these lists:

Reinforcement learning with a corrupted reward function

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Reinforcement learning with a corrupted reward function

This talk is included in these lists:

Other lists

Other talks

Reinforcement learning with a corrupted reward function

Abstract

Included in Lists