Natural gradient in deep neural networks
- đ¤ Speaker: Alberto Bernacchia (University of Cambridge)
- đ Date & Time: Wednesday 21 November 2018, 14:00 - 15:30
- đ Venue: Engineering Department, CBL Room 438
Abstract
We introduce the natural gradient method for stochastic optimization, and discuss whether and how this method could be applied to deep neural networks. We motivate the natural gradient by showing that the performance of stochastic gradient descent depends heavily on the choice of parameters, and it does not take into account the information geometry of the model. We show that this geometry is described by the Fisher information metric, and the steepest descent in the loss function is realized by the natural gradient, which is invariant to changes in parameters. We connect natural gradient with second-order optimization methods and discuss possible applications to deep neural networks. In particular, we present K-FAC, a specific method based on approximating the inverse Fisher information matrix as Kronecker-factorized blocks and independent layers. This allows connecting a variety of different methods under a unified framework (e.g. adaptive gradients, batch normalization, whitening). We describe applications of K-FAC to both standard and convolutional neural networks, and compare with state-of-art methods.
Series This talk is part of the Machine Learning Reading Group @ CUED series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Cambridge University Engineering Department Talks
- Centre for Smart Infrastructure & Construction
- Chris Davis' list
- Computational Continuum Mechanics Group Seminars
- custom
- Engineering Department, CBL Room 438
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Machine Learning Reading Group
- Machine Learning Reading Group @ CUED
- Machine Learning Summary
- ML
- ndk22's list
- ob366-ai4er
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- School of Technology
- Simon Baker's List
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 21 November 2018, 14:00-15:30