Log in

Cambridge users (raven) details

Other users details

No account? details

Information on

Subscribing to talks details

Finding a talk details

Adding a talk details

Disseminating talks details

Help and Documentation details

A Fast Decoder for Joint Word Segmentation and POS-Tagging using a Single Discriminative Model

Add to your list(s) Download to your calendar using vCal

Yue Zhang and Stephen Clark, University of Cambridge
Friday 01 October 2010, 12:30-13:00
SW01, Computer Laboratory.

If you have a question about this talk, please contact Thomas Lippincott.

We show that the standard beam-search algorithm can be used as an efficient decoder for the global linear model of Zhang and Clark (2008) for joint word segmentation and POS -tagging, achieving a significant speed improvement. Such decoding is enabled by: (1) separating full word features from partial word features so that feature templates can be instantiated incrementally, according to whether the current character is separated or appended; (2) deciding the POS -tag of a potential word when its first character is processed. Early-update is used with perceptron training so that the linear model gives a high score to a correct partial candidate as well as a full output. Effective scoring of partial structures allows the decoder to give high accuracy with a small beam-size of 16. In our 10-fold cross-validation experiments with the Chinese Treebank, our system performed over 10 times as fast as Zhang and Clark (2008) with little accuracy loss. The accuracy of our system on the standard CTB 5 test was competitive with the best in the literature.

This talk is part of the NLIP Seminar Series series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

Log in

Information on

A Fast Decoder for Joint Word Segmentation and POS-Tagging using a Single Discriminative Model

This talk is included in these lists:

Other lists

Other talks