Deep learning as optimal control problems and Riemannian discrete gradient descent.
- đ¤ Speaker: Elena Celledoni (Norwegian University of Science and Technology; Norwegian University of Science and Technology)
- đ Date & Time: Thursday 21 November 2019, 15:05 - 15:45
- đ Venue: Seminar Room 2, Newton Institute
Abstract
We consider recent work where deep learning neural networks have been interpreted as discretisations of an optimal control problem subject to an ordinary differential equation constraint. We review the first order conditions for optimality, and the conditions ensuring optimality after discretisation. This leads to a class of algorithms for solving the discrete optimal control problem which guarantee that the corresponding discrete necessary conditions for optimality are fulfilled. The differential equation setting lends itself to learning additional parameters such as the time discretisation. We explore this extension alongside natural constraints (e.g. time steps lie in a simplex). We compare these deep learning algorithms numerically in terms of induced flow and generalisation ability. References - M Benning, E Celledoni, MJ Ehrhardt, B Owren, CB Schönlieb, Deep learning as optimal control problems: models and numerical methods, JCD.
Series This talk is part of the Isaac Newton Institute Seminar Series series.
Included in Lists
- All CMS events
- bld31
- dh539
- Featured lists
- INI info aggregator
- Isaac Newton Institute Seminar Series
- School of Physical Sciences
- Seminar Room 2, Newton Institute
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Elena Celledoni (Norwegian University of Science and Technology; Norwegian University of Science and Technology)
Thursday 21 November 2019, 15:05-15:45