University of Cambridge > Talks.cam > CUED Speech Group Seminars > Investigation of multilingual speech-to-text systems for use in spoken term detection

Investigation of multilingual speech-to-text systems for use in spoken term detection

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Rogier van Dalen.

The development of high-performance speech processing systems for low-resource languages is a challenging research area. One approach to address the lack of resources is to make use of data from multiple languages. A popular direction in recent years is to use bottleneck features or hybrid systems trained on multilingual data for speech-to-text (STT) systems. This talk presents an overview of these approaches for STT , and their performance for both speech recognition and spoken term detection. Experiments will be presented based on the IARPA Babel limited language pack corpora (10 hours/language) using 7 languages for multilingual system development and 3 held-out target languages.

This talk is part of the CUED Speech Group Seminars series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2019 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity