University of Cambridge > Talks.cam > NLIP Seminar Series > Towards More Robust and Interpretable Models for Structured Prediction and Language Generation

Towards More Robust and Interpretable Models for Structured Prediction and Language Generation

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Amandla Mabona.

Many important tasks in natural language processing involve problems with structured output spaces, such as machine translation and syntactic parsing. In line with recent advances in representation learning, I will describe two ways to improve state of the art neural methods in language modeling and syntactic parsing. Despite the expressive power of recurrent neural networks, I explore the hypothesis that such models can still benefit from linguistically-motivated inductive biases, thereby facilitating better generalization given a limited amount of labeled data. I will also argue for the importance of cost functions in learning, where a refined notion of costs, based on the idea of ensemble distillation, improves the performance of a strong neural dependency parser.

In the opposite direction, neural representation learners can be thought of as “mini-linguists”, in the sense that they often need to come up with generalizations and theories about certain latent aspects of language. The learners’ findings can then be used as empirical means to confirm or refute certain linguistic hypotheses. To this end, I demonstrate that the findings of Recurrent Neural Network Grammars (RNNG), a state of the art model for parsing and language modeling, mostly align with certain linguistic theories of syntax, while also discovering some syntactic phenomena that are different from our intuition, but are interesting nevertheless.

BIOGRAPHY : Adhiguna Kuncoro is a DPhil student in computer science at the University of Oxford and research scientist at DeepMind. His primary research interest lies at the intersection of natural language processing and machine learning, particularly on designing statistical models of natural language that are: i.) robust, ii.) interpretable, and iii.) able to learn more with less amount of annotated data. His co-authored work received an Outstanding Long Paper award at EACL 2017 . He holds a Master’s degree in language technologies from CMU LTI , where he worked on low-resource language processing under the DARPA LORELEI project, and another Master’s degree in computer science from the University of Oxford. His DPhil study at Oxford is jointly funded by an EPSRC studentship under the flexibility to support the very best students scheme, the Oxford Department of Computer Science scholarship, and a Balliol Mark Sadler scholarship award. He completed his undergraduate degree in Informatics Engineering from Institut Teknologi Bandung, Indonesia.

This talk is part of the NLIP Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity