research-article

Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing

Authors:
Jun Ye

University of Central Florida

University of Central Florida
View Profile

,
Kien A. Hua

University of Central Florida

University of Central Florida
View Profile

ACM Transactions on Multimedia Computing, Communications, and Applications Volume 11 Issue 2Article No.: 28pp 1–23https://doi.org/10.1145/2645864

Published:07 January 2015Publication History

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

Live video computing (LVC) on distributed smart cameras has many important applications; and a database approach based on a Live Video DataBase Management System (LVDBMS) has shown to be effective for general LVC application development. The performance of such a database system relies on accurate interpretation of spatial relationships among objects in the live video. With the popularity of affordable depth cameras, 3D spatial computation techniques have been applied. However, the 3D object models currently used are expensive to compute, and offer limited scalability. We address this drawback in this article by proposing an octree-based 3D spatial logic and presenting algorithms for computing 3D spatial relationships using depth cameras. To support continuous query processing on live video streams, we also develop a GPU-based implementation of the proposed technique to further enhance scalability for real-time applications. Extensive performance studies based on a public RGB-D dataset as well as the LVDBMS prototype demonstrates the correctness and efficiency of our techniques.

References

Pradeep Kumar Atrey. 2009. A hierarchical model for representation of events in multimedia observation systems. In Proceedings of the 1st ACM International Workshop on Events in Multimedia. ACM, 57--64. Google ScholarDigital Library
Pradeep Kumar Atrey, Mohan S Kankanhalli, and Ramesh Jain. 2006. Information assimilation framework for event detection in multimedia surveillance systems. Multimedia Syst. 12, 3, 239--253. Google ScholarDigital Library
A. J. Aved and Kien A. Hua. 2012. A general framework for managing and processing live video data with privacy proctection. Multimedia Syst. 18, 2, 123--143.Google ScholarCross Ref
I. Bloch. 1999. Fuzzy relative position between objects in image processing: a morphological approach. IEEE Trans. Pattern Anal. Mach. Intell. 21, 7, 657--664. Google ScholarDigital Library
Isabella Bloch, Olivier Colliot, and Roberto M. Cesar Jr. 2006. On the Ternary Spatial Relation Between. IEEE Trans. Syst. Man Cybern. 36, 2, 312--327. Google ScholarDigital Library
André Borrmann, Stefanie Schraufstetter, and Ernst Rank. 2007. An octree-based implementation of directional operators in a 3D spatial query language for building information models. In Proceedings of the 24th CIB-W78 Conference on IT in Construction.Google Scholar
J. M. Coughlan and A. L. Yuille. 1999. ManhattanWorld: compass direction from a single image by Bayesian inference. In Proceedings of the 7th IEEE International Conference on Computer Vision. 941--947. Google ScholarDigital Library
M. A. Fischler and R. C. Bolles. 1981. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography. Commun. ACM 24, 6, 381--395. Google ScholarDigital Library
Klaus-Peter Gapp. 1994. From vision to language: A cognitive approach to the computation of spatial relations in 3D space. In Proceedings of the 17th Annual Conference of the Cognitive Science Society.Google Scholar
Zhaoyin Jia, Andrew Gallagher, Ashutosh Saxena, and Tsuhan Chen. 2013. 3d-based reasoning with blocks, support, and stability. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 1--8. Google ScholarDigital Library
A. Kasper, R. Jakel, and R. Dillmann. 2011. Using spatial relations of objects in real world scenes for scene structuring and scene understanding. In Proceedings of the 15th International Conference on Advanced Robotics. 421--426.Google Scholar
J. M. Keller and X.Wang. 1995. Comparison of spatial relation definitions in computer vision. In Proceedings of the 3rd International Symposium on Uncertainty Modeling and Analysis. 679--684. Google ScholarDigital Library
J. M. Keller and X. Wang. 1996. Learning spatial relationships in computer vision. In Proceedings of the 5th IEEE International Conference on Fuzzy Systems. 118--124.Google Scholar
Khronos. 2013. OpenCL. http://www.khronos.org/opencl/. (2013).Google Scholar
B. J. Kuipers and T. S. Levitt. 1988. Navigation and mapping in large-scale space. AI Mag. 9, 2. Google ScholarDigital Library
K. Lai, L. Bo, X. Ren, and D. Fox. 2011. A Large-Scale Hierarchical Multi-View RGBD Object Dataset. In Proceedings of the IEEE International Conference on Robotics and Automation.Google Scholar
Donald Meagher. 1982. Geometric modeling using octree encoding. Computer Graph. Image Process. 19, 2, 129--147.Google ScholarCross Ref
K. Miyajima and A. Ralescu. 1994. Spatial organization in 2D segmented images: representation and recognition of primitive spatial relations. Fuzzy Sets Syst. 65, 2--3, 225--236. Google ScholarDigital Library
R. Peng, A. J. Aved, and K. A. Hua. 2010. Real-time query processing on live videos in networks of distributed cameras. Int. J. Interdisciplinary Telecomm. Netw. 2, 1. Google ScholarDigital Library
Abu Saleh Md Mahfujur Rahman, M. Anwar Hossain, and Abdulmotaleb El Saddik. 2010. Spatial-geometric approach to physical mobile interaction based on accelerometer and ir sensory data fusion. ACM Trans. Multimedia Comput. Commun. Appl. 6, 4, 28:1--28:23. Google ScholarDigital Library
B. Rosman and S. Ramamoorthy. 2011. Learning spatial relationships between objects. Int. J. Rob. Res. 30, 1, 1328--1342. Google ScholarDigital Library
Nadeem Salamat and El-Hadi Zahzah. 2012. On the improvement of combined fuzzy topological and directional relations information. Pattern Recognit. 45, 4, 1559--1568. Google ScholarDigital Library
Celina Maki Takemura, Roberto M. Cesar Jr., and Isabelle Bloch. 2012. Modeling and measuring the spatial relation along: regions, contours and fuzzy sets. Pattern Recognit. 45, 2, 757--766. Google ScholarDigital Library
Yoshihiro Tashiro. 1977. On methods for generating uniform random points on the surface of a sphere. Ann. Insti. Statistical Math. 29, 1, 295--300.Google ScholarCross Ref
Jun Ye and Kien A. Hua. 2013. Exploiting depth camera for 3D spatial relationship interpretation. In Proceedings of the 4th ACM Multimedia Systems Conference (MMSys'13). 151--161. Google ScholarDigital Library
Kiwon Yun, Jean Honorio, Debaleena Chattopadhyay, Tamara L. Berg, and Dimitris Samaras. 2012. Two-person interaction detection using body-pose features and multiple instance learning. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops. 28--35.Google ScholarCross Ref

Index Terms

Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Image and video acquisition
        3D imaging
2. Information systems
  1. Information systems applications
    1. Multimedia information systems
      1. Multimedia databases

Recommendations

Query processing over live and archived data streams
Read More
Video query processing in the VDBMS testbed for video database research
MMDB '03: Proceedings of the 1st ACM international workshop on Multimedia databases

The increased use of video data sets for multimedia-based applications has created a demand for strong video database support, including efficient methods for handling the content-based query and retrieval of video data. Video query processing presents ...
Read More
Maintaining Spatial Relationships in Uncertain Environments
SASO '12: Proceedings of the 2012 IEEE Sixth International Conference on Self-Adaptive and Self-Organizing Systems

Inter-related computations must sometimes maintain spatial relationships as they attempt to individually respond to changes in the environment. These changes may be related to application requirements (as in a sensor network) or resource requirements. ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Multimedia Computing, Communications, and Applications Volume 11, Issue 2
December 2014
197 pages
ISSN:1551-6857
EISSN:1551-6865
DOI:10.1145/2716635
Editor:
Ralf Steinmetz
Technische Universität Darmstadt, Germany
Issue’s Table of Contents
Copyright © 2015 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 7 January 2015
- Accepted: 1 June 2014
- Revised: 1 February 2014
- Received: 1 October 2013
Published in tomm Volume 11, Issue 2

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
3D reconstruction
Live video computing
live video database
spatial relationships
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 0
  Total Citations
  View Citations
- 172
  Total Downloads
- Downloads (Last 12 months)6
- Downloads (Last 6 weeks)1
Other Metrics
View Author Metrics
Cited By
This publication has not been cited yet

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Query processing over live and archived data streams

Video query processing in the VDBMS testbed for video database research

Maintaining Spatial Relationships in Uncertain Environments

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Octree-Based 3D Logic and Computation of Spatial Relationships in Live Video Query Processing

ACM Transactions on Multimedia Computing, Communications, and Applications

Abstract

References

Cited By

Index Terms

Recommendations

Query processing over live and archived data streams

Video query processing in the VDBMS testbed for video database research

Maintaining Spatial Relationships in Uncertain Environments

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media