Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints

Add to your list(s) Download to your calendar using vCal

Yufan Guo, University of Cambridge
Friday 17 May 2013, 12:00-13:00
FW26, Computer Laboratory.

If you have a question about this talk, please contact Ekaterina Kochmar.

Inferring the information structure of scientific documents is useful for many down-stream applications. Existing feature-based machine learning approaches to this task require substantial training data and suffer from limited performance. Our idea is to guide feature-based models with declarative domain knowledge encoded as posterior distribution constraints. We explore a rich set of discourse and lexical constraints which we incorporate through the Generalized Expectation (GE) criterion. Our constrained model improves the performance of existing fully and lightly supervised models. Even a fully unsupervised version of this model outperforms lightly supervised feature-based models, showing that our approach can be useful even when no labeled data is available.

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

Improved Information Structure Analysis of Scientific Documents Through Discourse and Lexical Constraints

This talk is included in these lists:

Other lists

Other talks