skip to main content
Department of Computer Science University of Colorado Boulder
cu: home | engineering | mycuinfo | about | cu a-z | search cu | contact cu cs: about | calendar | directory | catalog | schedules | mobile | contact cs
home · events · colloquia · 2007-2008 · 

Colloquium - Cheney

ECCR 265

Language-Based Foundations for Data Provenance
Cornell University

Bioinformatics and other disciplines now rely heavily on "curated" databases that are built up by the manual effort of expert scientists. Curation yields higher-quality results than any fully automatic technique, but it is labor-intensive and costly. Because judgments of the quality of the data ultimately rest on the choices made by the database curators, it is crucial to maintain adequate provenance records showing the database's history. Currently, however, provenance is not well-supported by databases and other systems used by curators. Instead, it is maintained by manual curator effort or ad-hoc systems which do little to ensure that the provenance record is correct, complete and useful.

In my view, the key unsolved problem in this area is developing clear high-level specifications and correctness guarantees for various techniques that justify their inclusion into general-purpose systems. My approach is to adapt techniques from programming languages and semantics to the setting of databases. I will present two provenance-tracking techniques that provide strong guarantees, one based on the intuition that provenance should indicate where data in the output of a query or update "comes from" in the input, and one based on the idea that provenance should highlight all parts of the input that "explain" a part of the output.

Hosted by Amer Diwan.

The Department holds colloquia throughout the Fall and Spring semesters. These colloquia, open to the public, are typically held on Thursday afternoons, but sometimes occur at other times as well. If you would like to receive email notification of upcoming colloquia, subscribe to our Colloquia Mailing List. If you would like to schedule a colloquium, see Colloquium Scheduling.

Sign language interpreters are available upon request. Please contact Stephanie Morris at least five days prior to the colloquium.

See also:
Department of Computer Science
College of Engineering and Applied Science
University of Colorado Boulder
Boulder, CO 80309-0430 USA
Send email to

Engineering Center Office Tower
ECOT 717
FAX +1-303-492-2844
XHTML 1.0/CSS2 ©2012 Regents of the University of Colorado
Privacy · Legal · Trademarks
May 5, 2012 (13:29)