skip to main content
Department of Computer Science University of Colorado Boulder
cu: home | engineering | mycuinfo | about | cu a-z | search cu | contact cu cs: about | calendar | directory | catalog | schedules | mobile | contact cs
home · events · colloquia · 2007-2008 · 

Colloquium - Lv

ECCR 265

Similarity Search for Large-Scale Feature-Rich Data
Stony Brook University

Content-based similarity search for feature-rich data (such as digital photos, audio, video, and scientific sensor data) is a difficult problem due to the high dimensionality and usually massive amounts of data. The main challenge is to achieve high-quality similarity search with high speed and low space usage.

Qin (Christine) Lv photo

This talk presents several techniques to address the problem of building efficient similarity search systems for large-scale feature-rich data. The first is a sketch construction algorithm for compact metadata representation, which can typically reduce the metadata size by an order of magnitude with minimal impact on search quality. The second is a multi-probe locality sensitive hashing (LSH) technique for indexing high-dimensional data, which substantially improves upon previous methods in both space and time efficiency. We have also developed Ferret, a general-purpose toolkit for building efficient similarity search systems. The Ferret toolkit has been successfully used to build similarity search systems for digital images, speech recordings, video, 3D shape models, and microarray gene expression data.

Qin (Christine) Lv is an Assistant Professor in the Computer Science Department, Stony Brook University (SUNY). She received her BE degree from Tsinghua University in 2000 and PhD degree in Computer Science from Princeton University in 2006. Lv's primary research interest is to develop efficient systems for managing and exploring massive amounts of digital data. Rooted in systems, her research also interacts with the areas of algorithm design, data mining, machine learning, and specific application domains such as multimedia, bio-informatics, healthcare, and scientific computing.

The Department holds colloquia throughout the Fall and Spring semesters. These colloquia, open to the public, are typically held on Thursday afternoons, but sometimes occur at other times as well. If you would like to receive email notification of upcoming colloquia, subscribe to our Colloquia Mailing List. If you would like to schedule a colloquium, see Colloquium Scheduling.

Sign language interpreters are available upon request. Please contact Stephanie Morris at least five days prior to the colloquium.

See also:
Department of Computer Science
College of Engineering and Applied Science
University of Colorado Boulder
Boulder, CO 80309-0430 USA
Send email to

Engineering Center Office Tower
ECOT 717
FAX +1-303-492-2844
XHTML 1.0/CSS2 ©2012 Regents of the University of Colorado
Privacy · Legal · Trademarks
May 5, 2012 (13:29)