skip to main content
10.1145/3269206.3269208acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

BBoxDB - A Scalable Data Store for Multi-Dimensional Big Data

Published:17 October 2018Publication History

ABSTRACT

BBoxDB is a distributed and highly available key-bounding-box-value store which enhances the classical key-value data model with an axis-parallel bounding box. The bounding box describes the location of the values in an n-dimensional space, and enables BBoxDB to efficiently distribute multi-dimensional data across a cluster of nodes. Well-known geometric algorithms (such as the K-D Tree) are used to create distribution regions (multi-dimensional shards). Distribution regions are created dynamically, based on the stored data. BBoxDB stores data of multiple tables co-partitioned, which enables efficient distributed spatial joins. Spatial joins on co-partitioned tables can be executed without data shuffling between nodes. A two-level index structure is employed to retrieve stored data quickly. We demonstrate the interaction with the system, the dynamic creation of distribution regions and the data redistribution feature of BBoxDB.

References

  1. Apache HBase 2018. Website of Apache HBase. (2018). https://hbase.apache.org/ {Online; accessed 03-Mar-2018}.Google ScholarGoogle Scholar
  2. BBoxDB 2018. BBoxDB at the maven repository. (2018). https://maven-repository. com/artifact/org.bboxdb {Online; accessed 03-Mar-2018}.Google ScholarGoogle Scholar
  3. BBoxDB 2018. Website of the BBoxDB project. (2018). http://bboxdb.org {Online; accessed 03-Mar-2018}.Google ScholarGoogle Scholar
  4. J. L. Bentley. 1975. Multidimensional Binary Search Trees Used for Associative Searching. Commun. ACM 18, 9 (Sept. 1975), 509--517. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. 2007. Dynamo: Amazon's Highly Available Key-value Store. SIGOPS Oper. Syst. Rev. 41, 6 (Oct. 2007), 205--220. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Eldawy and M. F. Mokbel. 2015. SpatialHadoop: A MapReduce Framework for Spatial Data. In 31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13--17, 2015. 1352--1363.Google ScholarGoogle Scholar
  7. R. H. Guting, T. Behr, and C. D ¨ untgen. 2010. SECONDO: A Platform for Mov- ¨ ing Objects Database Research and for Publishing and Integrating Research Implementations. IEEE Data Eng. Bull. 33, 2 (2010), 56--63.Google ScholarGoogle Scholar
  8. R. H. Guting and M. Schneider. 2005. ¨ Moving Objects Databases. Morgan Kaufmann. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. A. Guzman. 1984. R-trees: A Dynamic Index Structure for Spatial Searching. SIGMOD Rec. 14, 2 (June 1984), 47--57. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. P. Hunt, M. Konar, F. P. Junqueira, and B. Reed. 2010. ZooKeeper: Wait-free Coordination for Internet-scale Systems. In Proceedings of the 2010 USENIX Conference on USENIX Annual Technical Conference (USENIXATC'10). USENIX Association, Berkeley, CA, USA, 11--25. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Lakshman and P. Malik. 2010. Cassandra: A Decentralized Structured Storage System. SIGOPS Oper. Syst. Rev. 44, 2 (April 2010), 35--40. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. M. Morton. 1966. A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing. International Business Machines Company.Google ScholarGoogle Scholar
  13. J. K. Nidzwetzki and R. H. Guting. 2017. Distributed Secondo: An Extensible and ¨ Scalable Database Management System. Distrib. Parallel Databases 35, 3--4 (Dec. 2017), 197--248. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Nishimura, S. Das, D. Agrawal, and A. E. Abbadi. 2011. MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware Services. In Proceedings of the 2011 IEEE 12th International Conference on Mobile Data Management - Volume 01 (MDM '11). IEEE Computer Society, Washington, DC, USA, 7--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. OpenStreetMap Project 2018. Website of the Open Street Map Project. (2018). http://www.openstreetmap.org - {Online; accessed 03-Mar-2018}.Google ScholarGoogle Scholar
  16. H. Samet. 2005. Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. BBoxDB - A Scalable Data Store for Multi-Dimensional Big Data

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader