Learning to Generate Natural Source Code
- đ¤ Speaker: Daniel Tarlow, Microsoft Research Cambridge
- đ Date & Time: Friday 15 November 2013, 12:00 - 13:00
- đ Venue: FW26, Computer Laboratory
Abstract
Natural source code is source code that is written by and meant to be understood by humans. I’ll talk about recent efforts to build generative models that (a) capture the structure present in source code, and (b) can be learned efficiently from large repositories of existing code. Our approach builds upon the fast training of neural probabilistic language models work of Mnih & Teh (2012), but incorporates hierarchical structure and much additional source code-specific structure. Empirically, our new models substantially outperform existing language models in terms of log probability of held out data, and samples from the learned models look more like real source code.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Friday 15 November 2013, 12:00-13:00