Pruning and grafting syntactic trees for cross-lingual transfer tasks
- đ¤ Speaker: Edoardo Ponti, TAL, University of Cambridge
- đ Date & Time: Friday 09 February 2018, 12:00 - 13:00
- đ Venue: FW26, Computer Laboratory
Abstract
Universal Dependencies is a framework for annotating syntactic trees consistently across languages to facilitate multilingual NLP and cross-lingual transfer. However, trees of equivalent sentences might assume non-overlapping shapes because of inherent typological variation. In particular, this anisomorphism is driven by the variation in 1) morphological assets and 2) in clause-level constructions (such as polar questions, predicative possession, relative clauses, etc.). In this work, we demonstrate that reducing the level of anisomorphism yields consistent gains for cross-lingual transfer tasks. First, we show how measuring anisomorphism improves the selection of the source in Dependency Parsing transfer. Second, we put forth a method to preprocess source trees matching their shapes with target trees inspired by typological documentation. This yields improvements in the BLEU scores of syntax-based Neural Machine Translation from Arabic to Dutch, and from Indonesian to Portuguese: we release these new datasets with the code. Our results indicate that the compatibility of the shapes of syntactic trees is crucial for source selection and for boosting cross-lingual transfer.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- FW26, Computer Laboratory
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)

Edoardo Ponti, TAL, University of Cambridge
Friday 09 February 2018, 12:00-13:00