University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

Log in

University Account

External (via Google)

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

Download to your calendar using vCal

Huyen Pham (Université de Paris)
Wednesday 20 April 2022, 09:00-10:00
Seminar Room 1, Newton Institute.

If you have a question about this talk, please contact nobody.

FD2W03 - Optimal control and fractional dynamics

We develop an exhaustive study of Markov decision process (MDP) under mean field interaction both on states and actions in the presence of common noise, and when optimization is performed over open-loop controls on infinite horizon. We highlight the crucial role of relaxed controls for this class of models, called CMKV -MDP for conditional McKean-Vlasov MDP , with respect to classical MDP theory. We prove the correspondence between CMKV -MDP and a general lifted MDP on the space of probability measures, and establish the dynamic programming Bellman fixed point equation satisfied by the value function, as well as the existence of ε-optimal randomized feedback controls. We obtain the propagation of chaos of the optimal value functions of the N-agent MDP to the CMKVMDP when N → +∞, with some convergence rate, denoted by O(MNγ ). We finally provide examples of application of the propagation of chaos result, by approximately solving several toy models for N-agent targeted advertising problem with social influence via the resolution of the associated CMKV -MDP. Based on joint work with Médéric Motte (LPSM).

This talk is part of the Isaac Newton Institute Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

📅 Download to calendar (vCal)

⚠️ Important: FD2W03 - Optimal control and fractional dynamics

👤 Speaker: Huyen Pham (Université de Paris)
📅 Date & Time: Wednesday 20 April 2022, 09:00 - 10:00
📍 Venue: Seminar Room 1, Newton Institute

Questions? Contact the organiser

Abstract

Series This talk is part of the Isaac Newton Institute Seminar Series series.

Included in Lists

Note: Ex-directory lists are not shown.

Log in

🔐 Log In

Information on

ℹ️ Information

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

This talk is included in these lists:

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

Abstract

Included in Lists

Log in

🔐 Log In

Information on

ℹ️ Information

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

This talk is included in these lists:

Other lists

Other talks

Mean-field Markov Decision process with common noise and randomized controls: convergence rate and applications to targeted advertising

Abstract

Included in Lists