skip to main content
10.1145/2996913.2996982acmotherconferencesArticle/Chapter ViewAbstractPublication PagesgisConference Proceedingsconference-collections
research-article
Public Access

GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform

Published:31 October 2016Publication History

ABSTRACT

Given two layers of large polygonal datasets, detecting those pairs of cross-layer polygons which satisfy a join predicate, such as intersection or contain, is one of the most computationally intensive primitive operations in the spatial domain applications. In this work, we introduce GCMF, an end-to-end software system, that is able to handle spatial join (with ST_Intersect operation) over non-indexed polygonal datasets with over 3 GB file size comprising more than 600, 000 polygons on a single GPU within less than 8 sec by applying innovative filter and refinement techniques. GCMF performs a two-step filtering phase. 1) A sort-based Minimum Bounding Rectangle (MBR) filtering step detects potentially overlapping polygon pairs up to 20 times faster than the optimized GEOS library routine. 2) A linear time Common MBR filtering step (based on the overlapping area of two given MBRs) that not only eliminates two-third of the candidate polygon pairs but also reduces the number of edges to be considered in the refinement phase by 40-fold on an average based on our experimental results with real datasets. Furthermore, for the refinement phase, GCMF implements a load-balanced parallel point-in-polygon and edge-intersection tests over GPU. Our experimental results with three different real datasets show up to 39-fold end-to- end speedup versus optimized sequential routines of GEOS C++ library as well as PostgreSQL spatial database with PostGIS.

References

  1. Aji, A., Teodoro, G., and Wang, F. Haggis: Turbocharge a mapreduce based spatial data warehousing system with GPU engine. In Proceedings of the 3rd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data (2014), ACM, pp. 15--20. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Arge, L., Procopiuc, O., Ramaswamy, S., Suel, T., and Vitter, J. S. Scalable sweeping-based spatial join. In VLDB (1998), vol. 98, Citeseer, pp. 570--581.Google ScholarGoogle Scholar
  3. Audet, S., Albertsson, C., Murase, M., and Asahara, A. Robust and efficient polygon overlay on parallel stream processors. In Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (2013), ACM, pp. 304--313. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Civilmap. Available at: https://blog.civilmaps.com/the-lidar-data-crunch/.Google ScholarGoogle Scholar
  5. García, Y. J., Lopez, M. A., and Leutenegger, S. T. On optimal node splitting for R-trees. In Proceedings of the 24rd International Conference on Very Large Data Bases (1998), Morgan Kaufmann Publishers Inc., pp. 334--344.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Jacox, E. H., and Samet, H. Spatial join techniques. ACM Transactions on Database Systems (TODS) 32, 1 (2007), 7.Google ScholarGoogle Scholar
  7. Leutenegger, S. T., Lopez, M., Edgington, J., Et Al. Str: A simple and efficient algorithm for R-tree packing. In Data Engineering, 1997. Proceedings. 13th International Conference on (1997), IEEE, pp. 497--506.Google ScholarGoogle ScholarCross RefCross Ref
  8. Lieberman, M. D., Sankaranarayanan, J., and Samet, H. A fast similarity join algorithm using graphics processing units. In Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on (2008), IEEE, pp. 1111--1120. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Luo, L., Wong, M. D., and Leong, L. Parallel implementation of R-trees on the GPU. In Design Automation Conference (ASP-DAC), 2012 17th Asia and South Pacific (2012), IEEE, pp. 353--358.Google ScholarGoogle ScholarCross RefCross Ref
  10. Mckenney, M., and Mcguire, T. A Parallel plane sweep algorithm for multi-core systems. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (2009), ACM, pp. 392--395. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Pavlovic, M., Tauheed, F., Heinis, T., and Ailamakit, A. GIPSY: joining spatial datasets with contrasting density. In Proceedings of the 25th International Conference on Scientific and Statistical Database Management (2013), ACM, p. 11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Postgis. http://postgis.net/.Google ScholarGoogle Scholar
  13. Prasad, S. K., Mcdermott, M., He, X., and Puri, S. GPU-based parallel R-tree construction and querying. In Parallel and Distributed Processing Symposium Workshop (IPDPSW), 2015 IEEE International (2015), IEEE, pp. 618--627.Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Prasad, S. K., Mcdermott, M., Puri, S., Shah, D., Aghajarian, D., Shekhar, S., and Zhou, X. a vision for GPU-accelerated parallel computation on geo-spatial datasets. SIGSPATIAL Special 6, 3 (2015), 19--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Puri, S., and Prasad, S. K. MPI-GIS: New parallel overlay algorithm and system prototype.Google ScholarGoogle Scholar
  16. Puri, S., and Prasad, S. K. A parallel algorithm for clipping polygons with improved bounds and a distributed overlay processing system using MPI. In Cluster, Cloud and Grid Computing (CCGrid), 2015 15th IEEE/ACM International Symposium on (2015), IEEE, pp. 576--585. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Shimrat, M. Algorithm 112: position of point relative to polygon. Communications of the ACM 5, 8 (1962), 434.Google ScholarGoogle Scholar
  18. Simion, B., Ray, S., and Brown, A. D. Speeding up spatial database query execution using gpus. Procedia Computer Science 9 (2012), 1870--1879. Google ScholarGoogle ScholarCross RefCross Ref
  19. Stonebraker, M., Frew, J., Gardels, K., and Meredith, J. The sequoia 2000 storage benchmark. In ACM SIGMOD Record (1993), vol. 22, ACM, pp. 2--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Yampaka, T., and Chongstitvatana, P. Spatial join with R-tree on graphics processing units. KMUTNB: International Journal of Applied Science and Technology 5, 3 (2013), 1--7.Google ScholarGoogle Scholar
  21. You, S., Zhang, J., and Gruenwald, L. Parallel spatial query processing on gpus using R-trees. In Proceedings of the 2nd ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data (2013), ACM, pp. 23--31. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. You, S., Zhang, J., and Gruenwald, L. Scalable and efficient spatial data management on multi-core CPU and GPU clusters: A preliminary implementation based on impala. In Data Engineering Workshops (ICDEW), 2015 31st IEEE International Conference on (2015), IEEE, pp. 143--148. Google ScholarGoogle ScholarCross RefCross Ref
  23. Zhang, J., and You, S. Cudagis: report on the design and realization of a massive data parallel GIS on GPUs. In Proceedings of the Third ACM SIGSPATIAL International Workshop on GeoStreaming (2012), ACM, pp. 101--108. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Zhang, J., and You, S. Speeding up large-scale point-in-polygon test based spatial join on GPUs. In Proceedings of the 1st ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data (2012), ACM, pp. 23--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Zhang, J., you, S., and Gruenwald, L. High-performance spatial query processing on big taxi trip data using gpgpus. In Big Data (BigData Congress), 2014 IEEE International Congress on (2014), IEEE, pp. 72--79.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Zhou, X., Abel, D. J., and Truffet, D. Data partitioning for parallel spatial join processing. Geoinformatica 2, 2 (1998), 175--204. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. GCMF: an efficient end-to-end spatial join system over large polygonal datasets on GPGPU platform

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Other conferences
        SIGSPACIAL '16: Proceedings of the 24th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
        October 2016
        649 pages
        ISBN:9781450345897
        DOI:10.1145/2996913

        Copyright © 2016 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 31 October 2016

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        SIGSPACIAL '16 Paper Acceptance Rate40of216submissions,19%Overall Acceptance Rate220of1,116submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader