skip to main content
10.1145/1247480.1247646acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
Article

Provenance in databases

Published:11 June 2007Publication History

ABSTRACT

The provenance of data has recently been recognized as central tothe trust one places in data. It is also important to annotation, todata integration and to probabilistic databases. Three workshops havebeen held on the topic, and it has been the focus of several researchprojects and prototype systems. This tutorial will attempt to providean overview of research in provenance in databases with a focus onrecent database research and technology in this area. This tutorialis aimed at a general database research audience and at people whowork with scientific data.

Skip Supplemental Material Section

Supplemental Material

p1171-buneman_56k.mp4

mp4

86.2 MB

p1171-buneman_768k.mp4

mp4

415.9 MB

References

  1. O. Benjelloun, A. D. Sarma, A. Y. Halevy, and J. Widom. ULDBs: Databases with Uncertainty and Lineage. In Very Large Data Bases (VLDB), pages 953--964, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. D. Bhagwat, L. Chiticariu, W. C. Tan, and G. Vijayvargiya. An Annotation Management System for Relational Databases. Very Large Data Bases (VLDB) Journal, 14(4):373--396, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  3. biodas.org. http://biodas.org.Google ScholarGoogle Scholar
  4. R. Bose and J. Frew. Lineage Retrieval for Scientific Data Processing: A Survey. ACM Computing Survey, 37(1):1--28, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. Bowers, T. McPhillips, B. Ludäscher, S. Cohen, and S. B. Davidson. A Model for User-Oriented Data Provenance in Pipelined Scientific Workflow. In International Provenance and Annotation Workshop (IPAW'06), Chicago, Illinois, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Buneman, A. Chapman, and J. Cheney. Provenance Management in Curated Databases. In Proceedings of the ACM SIGMOD International Conference on Management of Data(SIGMOD), pages 539--550, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Buneman, J. Cheney, and S. VanSummeren. On the Expressiveness of Implicit Provenance in Query and Update Languages. In International Conference on Database Theory (ICDT), pages 209--223, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Buneman, S. Khanna, and W. C. Tan. Why and Where: A Characterization of Data Provenance. In International Conference on Database Theory (ICDT), pages 316--330, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. P. Buneman, S. Khanna, and W. C. Tan. On Propagation of Deletions and Annotations Through Views. In Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium on Principles of database systems (PODS), pages 150--158, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. L. Chiticariu and W. C. Tan. Debugging Schema Mappings with Routes. In Very Large Data Bases (VLDB), pages 79--90, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Y. Cui, J. Widom, and J. L. Wiener. Tracing the Lineage of View Data in a Warehousing Environment. ACM Transactionson Database Systems, 25(2):179--227, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. F. Geerts, A. Kementsietsidis, and D. Milano. MONDRIAN: Annotating and Querying Databases through Colors and Blocks. In International Conference on Data Engineering (ICDE), page 82, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. T. J. Green, G. Karvounarakis, and V. Tannen. Provenance Semirings. In Proceedings of the ACM SIGMOD-SIGACT-SIGART Symposium on Principles of database systems (PODS) (To appear), 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Harvard University Art Museums, Provenance Research. http://www.artmuseums.harvard.edu/provenance/, cited on 14 November 2006.Google ScholarGoogle Scholar
  15. Z. Ives, N. Khandelwal, A. Kapur, and M. Cakir. Orchestra: Rapid, Collaborative Sharing of Dynamic Data. In Conference on Innovative Database Systems Research (CIDR), 2005.Google ScholarGoogle Scholar
  16. Y. Simmhan, B. Plale, and D. Gannon. A Survey of Data Provenance in E-Science. SIGMOD Record, 34:31--36, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. L. Simmhan, B. Plale, and D. Gannon. A Framework for Collecting Provenance in Data-Centric Scientific Workflows. In International Conference on Web Service (ICWS), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Szomszor and L. Moreau. Recording and Reasoning over Data Provenance in Web and Grid Services. In International Conference on Ontologies, Databases and Applications of SEmantics (ODBASE), pages 603--620, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  19. W. C. Tan. Containment of Relational Queries with Annotation Propagation. In Database Programming Languages (DBPL), pages 37--53, 2003.Google ScholarGoogle Scholar
  20. N. E. Taylor and Z. Ives. Reconciling while Tolerating Disagreement in Collaborative Data Sharing. In Proceedings of the ACM SIGMOD International Conference on Managementof Data (SIGMOD), pages 13--24, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Y. R. Wang and S. E. Madnick. A Polygen Model for Heterogeneous Database Systems: The Source Tagging Perspective. In Very Large Data Bases (VLDB), pages 519--538, 1990.Google ScholarGoogle Scholar
  22. J. Widom. Trio: A System for Integrated Management of Data, Accuracy, and Lineage. In Conference on Innovative Database Systems Research (CIDR), pages 262--276, 2005.Google ScholarGoogle Scholar
  23. S. C. Wong, S. Miles, W. Fang, P. Groth, and L. Moreau. Provenance-based Validation of E-Science Experiments. In Proceedings of Internation Semantic Web Conference (ISWC), pages 801--815, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Woodruff and M. Stonebraker. Supporting Fine-grained Data Lineage in a Database Visualization Environment. In International Conference on Data Engineering (ICDE), pages 91--102, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. J. Zhao, C. Wroe, C. Goble, R. Stevens, D. Quan, and M. Greenwood. Using Semantic Web Technologies for Representing e-Science Provenance. In International Semantic Web Conference (ISWC), pages 92--106, 2004.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Provenance in databases

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '07: Proceedings of the 2007 ACM SIGMOD international conference on Management of data
      June 2007
      1210 pages
      ISBN:9781595936868
      DOI:10.1145/1247480
      • General Chairs:
      • Lizhu Zhou,
      • Tok Wang Ling,
      • Program Chair:
      • Beng Chin Ooi

      Copyright © 2007 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 11 June 2007

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader