skip to main content
10.1145/1353343.1353363acmotherconferencesArticle/Chapter ViewAbstractPublication PagesedbtConference Proceedingsconference-collections
research-article
Free Access

A concurrency control protocol for parallel B-tree structures without latch-coupling for explosively growing digital content

Published:25 March 2008Publication History

ABSTRACT

While shared-nothing parallel infrastructures provide fast processing of explosively growing digital content, managing data efficiently across multiple nodes is important. The value-range partitioning method with parallel B-tree structures in a shared-nothing environment is an efficient approach for handling large amounts of data. To handle large amounts of data, it is also important to provide an efficient concurrency control protocol for the parallel B-tree. Many studies have proposed concurrency control protocols for B-trees, which use latch-coupling. None of these studies has considered that latch-coupling contains a performance bottleneck of sending of messages between processing elements (PEs) in distributed environments because latch-coupling is efficient for a B-tree on a single machine. The only protocol without latch-coupling is the B-link algorithm, but it is difficult to use the B-link algorithm directly on an entire parallel B-tree structure because it is necessary to guarantee the consistency of the side pointers. We propose a new concurrency control protocol named LCFB that requires no latch-coupling in optimistic processes. LCFB reduces the amount of communication between PEs during a B-tree traversal. To detect access path errors in the LCFB protocol caused by removal of latch-coupling, we assign boundary values to each index page. Because a page split may cause page deletion in a Fat-Btree, we also propose an effective method for handling page deletions without latch-coupling. We then combine LCFB with the B-link algorithm within each PE to reduce the cost of Structure Modification Operations (SMOs) in a PE, as a solution to the difficulty of consistency management for the side pointers in a parallel B-tree structure. To compare the performance of the proposed protocol with conventional protocols MARK-OPT, INC-OPT, and ARIES/IM, we implemented them on an autonomous disk system with a Fat-Btree structure. Experimental results in various environments indicate that the system throughput of the proposed protocols is always superior to those of the other protocols, especially in large-scale configurations, and LCFB with the B-link algorithm is effective at higher update ratios.

References

  1. R. Bayer and M. Schkolnick. Concurrency of operations on B-trees. Acta Inf., 9(1):1--21, 1977.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. P. A. Bernstein, V. Hadzilacos, and N. Goodman. Concurrency Control and Recovery in Database Systems. Addison-Wesley, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach, M. Burrows, T. Chandra, A. Fikes, and R. E. Gruber. Bigtable: A distributed storage system for structured data. In Proc. of OSDI2006, pages 205--218, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Cluster File Systems, Inc. Lustre: A scalable, high-performance file system. http://www.lustre.org/docs/whitepaper.pdf.Google ScholarGoogle Scholar
  5. R. Devine. Design and implementation of DDH: A distributed dynamic hashing algorithm. In Proc of FODO '93, pages 101--114, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. D. J. DeWitt and J. Gray. Parallel database systems: The future of high performance database systems. Commun. ACM, 35(6):85--98, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. S. Ghemawat, H. Gobioff, and S.-T. Leung. The Google file system. In Proc. of SOSP 2003, pages 29--43, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. Graefe. Write-optimized B-trees. In Proc. of VLDB 2004, pages 672--683, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Gray and A. Reuter. Transaction Processing: Concepts and Techniques. Morgan Kaufmann, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. I. Jaluta, S. Sippu, and E. Soisalon-Soininen. Concurrency control and recovery for balanced B-link trees. 14(2):257--277, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Johnson and D. Shasha. Utilization of B-trees with inserts, deletes and modifies. In Proc. of PODS'89, pages 235--246, 1989. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. T. Kung and P. L. Lehman. Concurrent manipulation of binary search trees. ACM Trans. Database Syst., 5(3):354--382, 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. V. Lanin and D. Shasha. A symmetric concurrent B-tree algorithm. In Proc. of FJCC'86, pages 380--389, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. M. Lee, Mong-Liand Kitsuregawa, B. C. Ooi, K.-L. Tan, and A. Mondal. Towards self-tuning data placement in parallel database systems. In Proc. of the SIGMOD'00, pages 225--236, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. P. L. Lehman and S. B. Yao. Efficient locking for concurrent operations on B-trees. ACM Trans. Database Syst., 6(4):650--670, 1981. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. W. Litwin, M.-A. Neimat, and D. A. Schneider. LH* --- linear hashing for distributed files. In Proc. of the SIGMOD '93, pages 327--336, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Lomet and B. Salzberg. Concurrency and recovery for index trees. VLDB Journal, 6(3):224--240, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. J. MacCormick, N. Murphy, M. Najork, C. A. Thekkath, and L. Zhou. Boxwood: Abstractions as the foundation for storage infrastructure. In Proc. of OSDI 2004, pages 105--120, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. J. Menon, D. A. Pease, R. Rees, L. D. ich, and B. Hillsberg. IBM storage tank --- a heterogeneous scalable SAN file system. IBM Syst. J., 42(2):250--267, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Miyazaki, Y. Abe, and H. Yokota. Availabilities and costs of reliable Fat-Btrees. In Proc. of PRDC2004, pages 163--172, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. J. Miyazaki and H. Yokota. Concurrency control and performance evaluation of parallel B-tree structures. IEICE Transactions on Information and Systems, E85-D(8):1269--1283, 2002.Google ScholarGoogle Scholar
  22. C. Mohan and F. Levine. ARIES/IM: An efficient and high concurrency index management method using write-ahead logging. In Proc. of the SIGMOD'92, pages 371--381, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. X. Ouyang, T. Yoshihara, and H. Yokota. An efficient commit protocol exploiting primary-backup placement in a distributed storage system. In Proc. of PRDC2006, pages 238--247, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. Phillips. Have storage area networks come of age? IEEE Computer, 31(7):10--12, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. N. Ponnekanti and H. Kodavalla. Online index rebuild. In Proc. of the SIGMOD'00, pages 529--538, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. SAS. SAS scalable performance data server. http://www.sas.com/technologies/dw/storage/spds/index.html.Google ScholarGoogle Scholar
  27. B. Seeger and P.-Å. Larson. Multi-disk B-trees. In Proc. of the SIGMOD '91, pages 436--445, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. V. Srinivasan and M. J. Carey. Performance of B-tree concurrency control algorithms. In Proc. of the SIGMOD '91, pages 416--425, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. H. Yokota. Autonomous disks for advanced database applications. In Proc. of DANTE'99, pages 435--442, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Yokota, Y. Kanemasa, and J. Miyazaki. Fat-Btree: An update-conscious parallel directory structure. In Proc. of ICDE'99, pages 448--457, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. T. Yoshihara, D. Kobayashi, and H. Yokota. MARK-OPT: A concurrency control protocol for parallel B-tree structures to reduce the cost of SMOs. IEICE Transactions on Information and Systems, E90-D(8):1213--1224, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. A concurrency control protocol for parallel B-tree structures without latch-coupling for explosively growing digital content

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Other conferences
          EDBT '08: Proceedings of the 11th international conference on Extending database technology: Advances in database technology
          March 2008
          762 pages
          ISBN:9781595939265
          DOI:10.1145/1353343

          Copyright © 2008 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 25 March 2008

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          Overall Acceptance Rate7of10submissions,70%

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader