ABSTRACT
BBoxDB is a distributed and highly available key-bounding-box-value store which enhances the classical key-value data model with an axis-parallel bounding box. The bounding box describes the location of the values in an n-dimensional space, and enables BBoxDB to efficiently distribute multi-dimensional data across a cluster of nodes. Well-known geometric algorithms (such as the K-D Tree) are used to create distribution regions (multi-dimensional shards). Distribution regions are created dynamically, based on the stored data. BBoxDB stores data of multiple tables co-partitioned, which enables efficient distributed spatial joins. Spatial joins on co-partitioned tables can be executed without data shuffling between nodes. A two-level index structure is employed to retrieve stored data quickly. We demonstrate the interaction with the system, the dynamic creation of distribution regions and the data redistribution feature of BBoxDB.
- Apache HBase 2018. Website of Apache HBase. (2018). https://hbase.apache.org/ {Online; accessed 03-Mar-2018}.Google Scholar
- BBoxDB 2018. BBoxDB at the maven repository. (2018). https://maven-repository. com/artifact/org.bboxdb {Online; accessed 03-Mar-2018}.Google Scholar
- BBoxDB 2018. Website of the BBoxDB project. (2018). http://bboxdb.org {Online; accessed 03-Mar-2018}.Google Scholar
- J. L. Bentley. 1975. Multidimensional Binary Search Trees Used for Associative Searching. Commun. ACM 18, 9 (Sept. 1975), 509--517. Google ScholarDigital Library
- G. DeCandia, D. Hastorun, M. Jampani, G. Kakulapati, A. Lakshman, A. Pilchin, S. Sivasubramanian, P. Vosshall, and W. Vogels. 2007. Dynamo: Amazon's Highly Available Key-value Store. SIGOPS Oper. Syst. Rev. 41, 6 (Oct. 2007), 205--220. Google ScholarDigital Library
- A. Eldawy and M. F. Mokbel. 2015. SpatialHadoop: A MapReduce Framework for Spatial Data. In 31st IEEE International Conference on Data Engineering, ICDE 2015, Seoul, South Korea, April 13--17, 2015. 1352--1363.Google Scholar
- R. H. Guting, T. Behr, and C. D ¨ untgen. 2010. SECONDO: A Platform for Mov- ¨ ing Objects Database Research and for Publishing and Integrating Research Implementations. IEEE Data Eng. Bull. 33, 2 (2010), 56--63.Google Scholar
- R. H. Guting and M. Schneider. 2005. ¨ Moving Objects Databases. Morgan Kaufmann. Google ScholarDigital Library
- A. Guzman. 1984. R-trees: A Dynamic Index Structure for Spatial Searching. SIGMOD Rec. 14, 2 (June 1984), 47--57. Google ScholarDigital Library
- P. Hunt, M. Konar, F. P. Junqueira, and B. Reed. 2010. ZooKeeper: Wait-free Coordination for Internet-scale Systems. In Proceedings of the 2010 USENIX Conference on USENIX Annual Technical Conference (USENIXATC'10). USENIX Association, Berkeley, CA, USA, 11--25. Google ScholarDigital Library
- A. Lakshman and P. Malik. 2010. Cassandra: A Decentralized Structured Storage System. SIGOPS Oper. Syst. Rev. 44, 2 (April 2010), 35--40. Google ScholarDigital Library
- G. M. Morton. 1966. A Computer Oriented Geodetic Data Base and a New Technique in File Sequencing. International Business Machines Company.Google Scholar
- J. K. Nidzwetzki and R. H. Guting. 2017. Distributed Secondo: An Extensible and ¨ Scalable Database Management System. Distrib. Parallel Databases 35, 3--4 (Dec. 2017), 197--248. Google ScholarDigital Library
- S. Nishimura, S. Das, D. Agrawal, and A. E. Abbadi. 2011. MD-HBase: A Scalable Multi-dimensional Data Infrastructure for Location Aware Services. In Proceedings of the 2011 IEEE 12th International Conference on Mobile Data Management - Volume 01 (MDM '11). IEEE Computer Society, Washington, DC, USA, 7--16. Google ScholarDigital Library
- OpenStreetMap Project 2018. Website of the Open Street Map Project. (2018). http://www.openstreetmap.org - {Online; accessed 03-Mar-2018}.Google Scholar
- H. Samet. 2005. Foundations of Multidimensional and Metric Data Structures. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA. Google ScholarDigital Library
Index Terms
- BBoxDB - A Scalable Data Store for Multi-Dimensional Big Data
Recommendations
BBoxDB: a distributed and highly available key-bounding-box-value store
AbstractBBoxDB is a distributed and highly available key-bounding-box-value store, which is designed to handle multi-dimensional big data. To handle large amounts of data, the software splits the stored data into multi-dimensional shards and spreads them ...
Continuous query processing in data streams using duality of data and queries
SIGMOD '06: Proceedings of the 2006 ACM SIGMOD international conference on Management of dataRecent data stream systems such as TelegraphCQ have employed the well-known property of duality between data and queries. In these systems, query processing methods are classified into two dual categories -- data-initiative and query-initiative -- ...
Comments