skip to main content
10.1145/2588555.2593671acmconferencesArticle/Chapter ViewAbstractPublication PagesmodConference Proceedingsconference-collections
research-article

On complexity and optimization of expensive queries in complex event processing

Published:18 June 2014Publication History

ABSTRACT

Pattern queries are widely used in complex event processing (CEP) systems. Existing pattern matching techniques, however, can provide only limited performance for expensive queries in real-world applications, which may involve Kleene closure patterns, flexible event selection strategies, and events with imprecise timestamps. To support these expensive queries with high performance, we begin our study by analyzing the complexity of pattern queries, with a focus on the fundamental understanding of which features make pattern queries more expressive and at the same time more computationally expensive. This analysis allows us to identify performance bottlenecks in processing those expensive queries, and provides key insights for us to develop a series of optimizations to mitigate those bottlenecks. Microbenchmark results show superior performance of our system for expensive pattern queries while most state-of-the-art systems suffer from poor performance. A thorough case study on Hadoop cluster monitoring further demonstrates the efficiency and effectiveness of our proposed techniques.

References

  1. J. Agrawal, Y. Diao, et al. Efficient pattern matching over event streams. In SIGMOD, 147--160, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Akdere, et al. Plan-based complex event detection across distributed sources. PVLDB, 1(1):66--77, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. H. Ali, C. Gerea, et al. Microsoft cep server and online behavioral targeting. VLDB, 2(2):1558--1561, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Alur and P. Madhusudan. Visibly pushdown languages. In STOC, 202--211, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Arasu, S. Babu, et al. The CQL continuous query language: semantic foundations and query execution. VLDB J., 15(2):121--142, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. R. S. Barga, J. Goldstein, et al. Consistent streaming through time: A vision for event stream processing. In CIDR, 363--374, 2007.Google ScholarGoogle Scholar
  7. D. A. M. Barrington, N. Immerman, et al. On uniformity within nc 1. J. Comput. Syst. Sci., 41(3):274--306, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. J. Boulon, A. Konwinski, et al. Chukwa, a large-scale monitoring system. In CCA, volume 8, 2008.Google ScholarGoogle Scholar
  9. Z. Cao, C. Sutton, et al. Distributed inference and query processing for rfid tracking and monitoring. PVLDB, 4(5):326--337, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Cugola and A. Margara. Tesla: a formally defined event specification language. In DEBS, 50--61, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. J. Demers, J. Gehrke, et al. Cayuga: A general purpose event monitoring system. In CIDR, 412--422, 2007.Google ScholarGoogle Scholar
  12. N. Immerman. Descriptive Complexity (Texts in Computer Science). Springer, 1999 edition, 1 1999.Google ScholarGoogle Scholar
  13. T. Johnson, et al. Monitoring regular expressions on out-of-order streams. In ICDE, 1315--1319, 2007.Google ScholarGoogle Scholar
  14. J. Kamp. Tense logic and the theory of linear order. 1968.Google ScholarGoogle Scholar
  15. S. Krishnamurthy, C. Wu, et al. On-the- y sharing for streamed aggregation. In SIGMOD, 623--634, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Liu, et al. High-performance nested cep query processing over event streams. In ICDE, 123--134, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. D. Luckham. Event Processing for Business: Organizing the Real-Time Enterprise. Wiley, 2011.Google ScholarGoogle Scholar
  18. M. Massie, B. Li, et al. Monitoring with ganglia, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. R. McNaughton and S. A. Papert. Counter-Free Automata (MIT research monograph no. 65). The MIT Press, 1971. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Y. Mei and S. Madden. Zstream: a cost-based query processor for adaptively detecting composite events. In SIGMOD, 193--206, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. B. Mozafari, et al. High-performance complex event processing over xml streams. In SIGMOD, 253--264, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. K. Ren, Y. Kwon, et al. Hadoop's adolescence. PVLDB, 6(10):853--864, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. R. Sadri, C. Zaniolo, et al. Expressing and optimizing sequence queries in database systems. ACM TODS, 29(2):282--318, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. N. P. Schultz-Müller, et al. Distributed complex event processing with query rewriting. In DEBS, 4. , 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. H. Vollmer. Introduction to circuit complexity: a uniform approach. Springer, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. D. Wang, et al. Active complex event processing over event streams. PVLDB, 4(10):634--645, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. D. Wang, E. A. Rundensteiner, et al. Probabilistic inference of object identifications for event stream analytics. In EDBT, 513--524, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. E. Wu, Y. Diao, et al. High-performance complex event processing over streams. In SIGMOD, 407--418, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. D. Yang, E. A. Rundensteiner, et al. A shared execution strategy for multiple pattern mining requests over streaming data. PVLDB, 2(1):874--885, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. H. Zhang, Y. Diao, et al. Recognizing patterns in streams with imprecise timestamps. PVLDB, 3(1):244--255, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. H. Zhang, et al. Optimizing expensive queries in complex event processing. Tech Report, http://sase.cs.umass.edu/Google ScholarGoogle Scholar

Index Terms

  1. On complexity and optimization of expensive queries in complex event processing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGMOD '14: Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data
      June 2014
      1645 pages
      ISBN:9781450323765
      DOI:10.1145/2588555

      Copyright © 2014 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 June 2014

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGMOD '14 Paper Acceptance Rate107of421submissions,25%Overall Acceptance Rate785of4,003submissions,20%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader