University of Cambridge > Talks.cam > RCEAL Tuesday Colloquia > Automatic Lexical Acquisition from the CHILDES Database

Automatic Lexical Acquisition from the CHILDES Database

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Teresa Parodi.

Empirical data regarding the syntactic complexity of children’s speech is important for theories of language acquisition. Currently much of this data is absent in the annotated versions of the CHILDES database. In this study, we show that a state-of-the-art subcategorization acquisition system (Preiss et al. 2007) can be used to extract large-scale subcategorization (frequency) information from the (i) child and (ii) child-directed speech within the CHILDES database without any domain-specific tuning. We demonstrate that the acquired information is sufficiently accurate to a) confirm previously reported research findings and b) yield completely new research findings for theoretical language acquisition research. We also report qualitative results which can be used to further improve parsing and lexical acquisition technology for child language data in the future.

This talk is part of the RCEAL Tuesday Colloquia series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity