The K-FAC method for neural network optimization
- đ¤ Speaker: James Martens, Google Deep Mind
- đ Date & Time: Thursday 14 March 2019, 14:00 - 15:00
- đ Venue: Cambridge University Engineering Department, CBL, BE4-38 (http://learning.eng.cam.ac.uk/Public/Directions)
Abstract
Second order optimization methods have the potential to be much faster than first order methods in the deterministic case, or pre-asymptotically in the stochastic case. However traditional second order methods have proven ineffective or impractical for neural network training, due in part to the extremely high dimension of the parameter space. Kronecker-factored Approximate Curvature (K-FAC) is second-order optimization method based on a tractable approximation to the Gauss-Newton/Fisher matrix that exploits the special structure of neural network training objectives. This approximation is neither low-rank nor diagonal, but instead involves Kronecker-products, which allows for efficient estimation, storage and inversion of the curvature matrix. In this talk I will introduce the basic K-FAC method for standard MLPs and then present some more recent work in this direction, including extensions to CNNs and RNNs, both of which requires new approximations to the Fisher. For these I will provide theoretically motivated arguments, as well as empirical results which speak to their efficacy in neural network optimization.
Series This talk is part of the Computational Neuroscience series.
Included in Lists
- All Talks (aka the CURE list)
- Biology
- Biology
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge Neuroscience Seminars
- CamBridgeSens
- Cambridge talks
- Cambridge University Engineering Department, CBL, BE4-38 (http://learning.eng.cam.ac.uk/Public/Directions)
- CBL important
- Chris Davis' list
- Computational and Biological Learning Seminar Series
- Computational Neuroscience
- Creating transparent intact animal organs for high-resolution 3D deep-tissue imaging
- custom
- dh539
- dh539
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group Journal Clubs
- Inference Group Summary
- Information Engineering Division seminar list
- Interested Talks
- Joint Machine Learning Seminars
- Life Science
- Life Science Interface Seminars
- Life Sciences
- Life Sciences
- Machine Learning @ CUED
- Machine Learning Summary
- ME Seminar
- ML
- my_list
- ndk22's list
- Neuroscience
- Neuroscience Seminars
- Neuroscience Seminars
- ob366-ai4er
- other talks
- Quantum Matter Journal Club
- Required lists for MLG
- rp587
- se456's list
- Seminar
- Simon Baker's List
- Stem Cells & Regenerative Medicine
- TQS Journal Clubs
- Trust & Technology Initiative - interesting events
- yk373's list
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

James Martens, Google Deep Mind
Thursday 14 March 2019, 14:00-15:00