Infinite Hidden Markov Models and Applications in NLP
- π€ Speaker: Jurgen van Gael, Department of Engineering, University of Cambridge
- π Date & Time: Friday 31 October 2008, 12:15 - 13:00
- π Venue: SW01, Computer Laboratory
Abstract
Since its invention 40 years ago, the Hidden Markov Model (HMM) has been successfully applied to domains such as vision, biology, natural language processing, etc. This success is arguably due to fast methods to do inference (forward-backward algorithm) and parameter learning (EM, Variational Bayes, etc.) In the standard supervised NLP application context, the number of hidden states (sometimes called the capacity of the HMM ) is chosen according to the (labelled) dataset used. Recent work (Goldwater & Griffiths 2007, Johnson 2007) has shown that unsupervised HMMs can be used efficiently to learn POS taggers from unlabelled data. However, the capacity used in that work is fixed in advance, which is not desirable when tackling new datasets/tasks and furthermore restricts the knowledge that can be learned from the data.
Recently, the machine learning community has turned its attention to nonparametric Bayesian methods. This framework allows us to treat the capacity of a model as a parameter which we want to learn. In this talk, I will introduce how nonparametric methods can be used to construct a nonparametric version of the HMM . I will compare the infinite HMM with other HMM models in the context of part-of-speech tagging.
Series This talk is part of the NLIP Seminar Series series.
Included in Lists
- All Talks (aka the CURE list)
- bld31
- Cambridge Centre for Data-Driven Discovery (C2D3)
- Cambridge Forum of Science and Humanities
- Cambridge Language Sciences
- Cambridge talks
- Chris Davis' list
- Computer Education Research
- Computing Education Research
- Department of Computer Science and Technology talks and seminars
- Graduate-Seminars
- Guy Emerson's list
- Interested Talks
- Language Sciences for Graduate Students
- ndk22's list
- NLIP Seminar Series
- ob366-ai4er
- PMRFPS's
- rp587
- School of Technology
- Simon Baker's List
- SW01, Computer Laboratory
- Trust & Technology Initiative - interesting events
- yk449
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Friday 31 October 2008, 12:15-13:00