skip to main content
Department of Computer Science University of Colorado Boulder
cu: home | engineering | mycuinfo | about | cu a-z | search cu | contact cu cs: about | calendar | directory | catalog | schedules | mobile | contact cs
home · events · colloquia · 2010-2011 · 

Colloquium - Parisien

ECCR 265

Finding Structure in the Muck: Bayesian Models of How Kids Learn to Use Verbs
University of Toronto
Chris Parisien photo

Children are fantastic data miners. In the first few years of their lives, they discover a vast amount of knowledge about their native language. This means learning not just the abstract representations that make up a language, but also learning how to generalize that knowledge to new situations -- in other words, figuring out how language is productive. Given the noise and complexity in what kids hear, this is incredibly difficult, yet still, it seems effortless. In verb learning, a lot of this generalization appears to be driven by strong regularities between form and meaning. Seeing how a certain verb has been used, kids can make a decent guess about what it means. Knowing what a verb means can suggest how to use it.

In this talk, I present a series of hierarchical Bayesian models to explain how children can acquire and generalize abstract knowledge of verbs from the language they would naturally hear. Using a large, messy corpus of child-directed speech, these models can discover a broad range of abstractions governing verb argument structure, verb classes, and alternation patterns. By simulating experimental studies in child development, I show that these complex probabilistic abstractions are robust enough to capture key generalization behaviours of children and adults. Finally, I will discuss some promising ways that the insights gained from modeling child language can benefit the development of a valuable large-scale linguistic resource, namely VerbNet.

Chris Parisien is a PhD Candidate in Computer Science at the University of Toronto, working in the Computational Linguistics group. He holds a BMath in Computer Science and Cognitive Science from the University of Waterloo and an MSc in Computer Science from Toronto. His work explores ways of using computational models to answer important questions in language development and psycholinguistics. By using nonparametric topic models to discover abstract structure in noisy, sparse corpus data, this work also considers how unsupervised learning methods can build detailed lexical resources from messy text. Chris enjoys collaborations with computer scientists, linguists, psychologists, and philosophers. He will complete his PhD in August of this year.

Hosted by Martha Palmer and James Martin.

The Department holds colloquia throughout the Fall and Spring semesters. These colloquia, open to the public, are typically held on Thursday afternoons, but sometimes occur at other times as well. If you would like to receive email notification of upcoming colloquia, subscribe to our Colloquia Mailing List. If you would like to schedule a colloquium, see Colloquium Scheduling.

Sign language interpreters are available upon request. Please contact Stephanie Morris at least five days prior to the colloquium.

See also:
Department of Computer Science
College of Engineering and Applied Science
University of Colorado Boulder
Boulder, CO 80309-0430 USA
Send email to

Engineering Center Office Tower
ECOT 717
FAX +1-303-492-2844
XHTML 1.0/CSS2 ©2012 Regents of the University of Colorado
Privacy · Legal · Trademarks
May 5, 2012 (13:29)