Efficient stochastic optimal control for navigation and motor planning
- đ¤ Speaker: Bert Kappen, Radboud University Nijmegen, The Netherlands
- đ Date & Time: Wednesday 11 July 2007, 11:00 - 12:00
- đ Venue: LR6, Engineering, Department of
Abstract
This talk discusses a class of non-linear stochastic control problems that can be efficiently solved using a path integral. In this control formalism, the central concept of cost-to-go or value function becomes a free energy and methods and concepts from statistical physics can be readily applied, such as Monte Carlo sampling or the Laplace approximation. Qualitatively different optimal control strategies for different noise levels can be understood as a result of spontaneous symmetry breaking. When applied to a receding horizon problem in a stationary environment, the solution resembles the one obtained by traditional reinforcement learning with discounted reward. An advantage of the path integral control method over RL is that the control can be computed for the current state, without considering all other states and 2) that it can be easily generalized to time-dependent tasks. It is therefore a suitable approach for time dependent control. We further discuss exploration and an how agents can approximately compute their coordination using belief propagation.
Series This talk is part of the Probabilistic Systems, Information, and Inference Group Seminars series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- Featured lists
- Information Engineering Division seminar list
- Interested Talks
- LR6, Engineering, Department of
- ndk22's list
- ob366-ai4er
- Probabilistic Systems, Information, and Inference Group Seminars
- rp587
- School of Technology
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 11 July 2007, 11:00-12:00