University of Cambridge > Talks.cam > Engineering Safe AI > AI Safety Gridworlds: Is my agent 'safe'?

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

AI Safety Gridworlds: Is my agent 'safe'?

Download to your calendar using vCal

Jessica Yung (University of Cambridge)
Wednesday 28 February 2018, 17:00-18:30
Cambridge University Engineering Department, CBL Seminar room BE4-38. For directions see http://learning.eng.cam.ac.uk/Public/Directions.

If you have a question about this talk, please contact Adrià Garriga Alonso .

AI Safety Gridworlds are a suite of 2D reinforcement learning environments that test for desirable safety properties of an agent, such as correct objective specification and robustness.

We will first discuss the paper’s approach to formalising safety properties in environments. Next, we will demo some of the environments and discuss whether they are reasonable tests of desirable properties. Finally, we will discuss why certain algorithms (among variations of RAINBOW and A2C ) seem to have better safety performance than others.

This week’s talk is a good opportunity to get a big-picture view of AI safety from a practical perspective. No prior knowledge of AI safety is needed.

You can try the environments for yourself by cloning this git repo: https://github.com/deepmind/ai-safety-gridworlds/tree/master/ai_safety_gridworlds

Paper: AI Safety Gridworlds (Leike et. al., 2017) https://arxiv.org/abs/1711.09883

This talk is part of the Engineering Safe AI series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

AI Safety Gridworlds: Is my agent 'safe'?

📅 Download to calendar (vCal)

👤 Speaker: Jessica Yung (University of Cambridge)
📅 Date & Time: Wednesday 28 February 2018, 17:00 - 18:30
📍 Venue: Cambridge University Engineering Department, CBL Seminar room BE4-38. For directions see http://learning.eng.cam.ac.uk/Public/Directions

Questions? Contact Adrià Garriga Alonso

Abstract

AI Safety Gridworlds are a suite of 2D reinforcement learning environments that test for desirable safety properties of an agent, such as correct objective specification and robustness.

This week’s talk is a good opportunity to get a big-picture view of AI safety from a practical perspective. No prior knowledge of AI safety is needed.

You can try the environments for yourself by cloning this git repo: https://github.com/deepmind/ai-safety-gridworlds/tree/master/ai_safety_gridworlds

Paper: AI Safety Gridworlds (Leike et. al., 2017) https://arxiv.org/abs/1711.09883

Series This talk is part of the Engineering Safe AI series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

AI Safety Gridworlds: Is my agent 'safe'?

This talk is included in these lists:

AI Safety Gridworlds: Is my agent 'safe'?

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

AI Safety Gridworlds: Is my agent 'safe'?

This talk is included in these lists:

Other lists

Other talks

AI Safety Gridworlds: Is my agent 'safe'?

Abstract

Included in Lists