University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Settling the sample complexity of online reinforcement learning

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Settling the sample complexity of online reinforcement learning

Download to your calendar using vCal

Yuxin Chen (University of Pennsylvania)
Tuesday 11 November 2025, 09:40-10:20
Seminar Room 1, Newton Institute.

If you have a question about this talk, please contact nobody.

SCLW01 - Bridging Stochastic Control And Reinforcement Learning: Theories and Applications

A central issue lying at the heart of online reinforcement learning (RL) is data efficiency. While a number of recent works achieved asymptotically minimal regret in online RL, the optimality of these results is only guaranteed in a “large-sample” regime, imposing enormous burn-in cost in order for their algorithms to operate optimally. How to achieve minimax-optimal regret without incurring any burn-in cost has been an open problem in RL theory. We settle this problem for the context of finite-horizon inhomogeneous Markov decision processes. Specifically, we prove that a modified version of Monotonic Value Propagation (MVP) achieves a regret that matches the minimax lower bound for the entire range of sample size, essentially eliminating any burn-in requirement. The key technical innovation lies in the development of a new regret decomposition strategy and a novel analysis paradigm to decouple complicated statistical dependency – a long-standing challenge facing the analysis of online RL in the sample-hungry regime. This is joint work with Zihan Zhang, Jason Lee and Simon Du.

This talk is part of the Isaac Newton Institute Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Settling the sample complexity of online reinforcement learning

📅 Download to calendar (vCal)

⚠️ Important: SCLW01 - Bridging Stochastic Control And Reinforcement Learning: Theories and Applications

👤 Speaker: Yuxin Chen (University of Pennsylvania)
📅 Date & Time: Tuesday 11 November 2025, 09:40 - 10:20
📍 Venue: Seminar Room 1, Newton Institute

Questions? Contact the organiser

Abstract

Series This talk is part of the Isaac Newton Institute Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Settling the sample complexity of online reinforcement learning

This talk is included in these lists:

Settling the sample complexity of online reinforcement learning

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Settling the sample complexity of online reinforcement learning

This talk is included in these lists:

Other lists

Other talks

Settling the sample complexity of online reinforcement learning

Abstract

Included in Lists