skip to main content
research-article

Approximate Semantic Matching of Events for the Internet of Things

Published:07 August 2014Publication History
Skip Abstract Section

Abstract

Event processing follows a decoupled model of interaction in space, time, and synchronization. However, another dimension of semantic coupling also exists and poses a challenge to the scalability of event processing systems in highly semantically heterogeneous and dynamic environments such as the Internet of Things (IoT). Current state-of-the-art approaches of content-based and concept-based event systems require a significant agreement between event producers and consumers on event schema or an external conceptual model of event semantics. Thus, they do not address the semantic coupling issue. This article proposes an approach where participants only agree on a distributional statistical model of semantics represented in a corpus of text to derive semantic similarity and relatedness. It also proposes an approximate model for relaxing the semantic coupling dimension via an approximation-enabled rule language and an approximate event matcher. The model is formalized as an ensemble of semantic and top-k matchers along with a probability model for uncertainty management. The model has been empirically validated on large sets of events and subscriptions synthesized from real-world smart city and energy management systems. Experiments show that the proposed model achieves more than 95% F1Score of effectiveness and thousands of events/sec of throughput for medium degrees of approximation while not requiring users to have complete prior knowledge of event semantics. In semantically loosely-coupled environments, one approximate subscription can compensate for hundreds of exact subscriptions to cover all possibilities in environments which require complete prior knowledge of event semantics. Results indicate that approximate semantic event processing could play a promising role in the IoT middleware layer.

References

  1. Bogdan Alexe, Wang-Chiew Tan, and Yannis Velegrakis. 2008. STBenchmark: Towards a benchmark for mapping systems. Proc. VLDB Endow. 1, 1 (2008), 230--244. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Kyle Anderson, Adrian Ocneanu, Diego Benitez, Derrick Carlson, Anthony Rowe, and Mario Berges. 2012. BLUED: A fully labeled public dataset for event-based non-intrusive load monitoring research. In Proceedings of the 2nd KDD Workshop on Data Mining Applications in Sustainability (SustKDD).Google ScholarGoogle Scholar
  3. Luigi Atzori, Antonio Iera, and Giacomo Morabito. 2010. The Internet of Things: A survey. Comput. Netw. 54, 15 (2010), 2787--2805. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Zohra Bellahsene, Angela Bonifati, Fabien Duchateau, and Yannis Velegrakis. 2011. On evaluating schema matching and mapping. In Schema Matching and Mapping, Springer, Bertin, 253--291.Google ScholarGoogle Scholar
  5. Antonio Carzaniga, David S. Rosenblum, and Alexander L. Wolf. 2000. Achieving scalability and expressiveness in an Internet-scale event notification service. In Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing. ACM, 219--227. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Gianpaolo Cugola and Alessandro Margara. 2012. Processing flows of information: From data stream to complex event processing. ACM Comput. Surv. 44, 3, Article 15 (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Edward Curry, Souleiman Hasan, and Sean O'Riain. 2012. Enterprise energy management using a linked dataspace for energy intelligence. In Proceedings of Sustainable Internet and ICT for Sustainability (SustainIT). IEEE, 1--6.Google ScholarGoogle Scholar
  8. Richard Cyganiak. 2013. Rooms in the DERI building. http://lab.linkeddata.deri.ie/2010/deri-rooms.Google ScholarGoogle Scholar
  9. Hong-Hai Do, Sergey Melnik, and Erhard Rahm. 2003. Comparison of schema matching evaluations. In Proceedings of the Revised Papers from the NODe Web and Database-Related Workshop on Web, Web-Services, and Database Systems. Akmal B. Chaudhri, Mario Jeckle, Erhard Rahm, and Rainer Unland (Eds.), Lecture Notes in Computer Science, vol. 2593, Springer, Berlin Heidelberg, 221--237. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. EsperTech. 2013. Esper complex event processing engine. http://esper.codehaus.org/.Google ScholarGoogle Scholar
  11. Patrick Th. Eugster, Pascal A. Felber, Rachid Guerraoui, and Anne-Marie Kermarrec. 2003. The many faces of publish/subscribe. ACM Comput. Surv. 35, 2 (2003), 114--131. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Françoise Fabret, François Llirbat, Joao Pereira, I. Rocquencourt, and Dennis Shasha. 2000. Efficient matching for content-based publish/subscribe systems. In Proceedings of the International Conference on Cooperative Information Systems (CoopIS). Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In Proceedings of the 20th International Joint Conference on Artifical Intelligence (IJCAI'07). Morgan Kaufmann Publishers Inc., San Francisco, CA, 1606--1611. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Avigdor Gal. 2006. Managing uncertainty in schema matching with top-K schema mappings. In Journal on Data Semantics VI, Stefano Spaccapietra, Karl Aberer, and Philippe Cudr-Mauroux (Eds.), Lecture Notes in Computer Science, vol. 4090, Springer, Berlin Heidelberg, 90--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Avigdor Gal. 2011. Uncertain schema matching. Synthesis Lect. Data Manage. 3, 1 (2011), 1--97.Google ScholarGoogle ScholarCross RefCross Ref
  16. Zellig S. Harris. 1954. Distributional structure. Word 10 (1954), 146--162.Google ScholarGoogle ScholarCross RefCross Ref
  17. Souleiman Hasan, Sean O'Riain, and Edward Curry. 2012. Approximate semantic matching of heterogeneous events. In Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems (DEBS'12). 252--263. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Souleiman Hasan, Sean O'Riain, and Edward Curry. 2013. Towards unified and native enrichment in event processing systems. In Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems (DEBS'13). 171--182. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Jing He, Yanchun Zhang, Guangyan Huang, and Jinli Cao. 2012. A smart Web service based on the context of things. ACM Trans. Internet Technol. 11, 3, Article 13 (2012). Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Gregor Hohpe and Bobby Woolf. 2004. Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions. Addison-Wesley Professional. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Katja Hose and Akrivi Vlachou. 2012. A survey of skyline processing in highly distributed environments. VLDB 21, 3 (2012), 359--384. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Danh Le-Phuoc, Minh Dao-Tran, Josiane Xavier Parreira, and Manfred Hauswirth. 2011. A native and adaptive approach for unified processing of linked streams and linked data. In Proceedings of the 10th International Semantic Web Conference (ISWC). Lecture Notes in Computer Science, vol. 7031, Springer, Berlin Heidelberg, 370--388. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yoonkyong Lee, Mayssam Sayyadian, AnHai Doan, and Arnon S. Rosenthal. 2007. eTuner: Tuning schema matching software using synthetic scenarios. VLDB 16, 1 (2007), 97--122. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Haifeng Liu and H.-Arno Jacobsen. 2002. A-TOPSS: A publish/subscribe system supporting approximate matching. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02). VLDB Endowment, 1107--1110. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Zhen Liu, Srinivasan Parthasarathy, Anand Ranganathan, and Hao Yang. 2008. Near-optimal algorithms for shared filter evaluation in data stream systems. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 133--146. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Merriam-Webster. 2012. Merriam-Webster's Collegiate Thesaurus. http://www.dictionaryapi.com/products/api-collegiate-thesaurus.htm.Google ScholarGoogle Scholar
  27. George A. Miller. 1995. WordNet: A lexical database for English. Commun. ACM 38, 11 (1995), 39--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Gero Mühl, Ludger Fiege, and Peter Pietzuch. 2006. Distributed Event-Based Systems. Vol. 1. Springer.Google ScholarGoogle Scholar
  29. OECD. 2012. Machine-to-machine communications: Connecting billions of devices. http://www.oecd-ilibrary.org.Google ScholarGoogle Scholar
  30. Milenko Petrovic, Ioana Burcea, and Hans-Arno Jacobsen. 2003. S-ToPSS: Semantic Toronto publish/subscribe system. In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03). VLDB Endowment, 1101--1104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. Luis Sanchez, José Antonio Galache, Veronica Gutierrez, J. M. Hernandez, J. Bernat, Alex Gluhak, and Tomás Garcia. 2011. SmartSantander: The meeting point between future Internet research and experimentation and the smart cities. In Proceedings of the Future Network & Mobile Summit (FutureNetw). IEEE, 1--8.Google ScholarGoogle Scholar
  32. Jinling Wang, Beihong Jin, and Jing Li. 2004. An ontology-based publish/subscribe system. In Proceedings of the 5th ACM/IFIP/USENIX International Conference on Middleware (Middleware'04). Springer-Verlag, Berlin Heidelber, 232--253. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Segev Wasserkrug, Avigdor Gal, and Opher Etzion. 2006. A taxonomy and representation of sources of uncertainty in active systems. In Proceedings of the 6th International Conference on Next Generation Information Technologies and Systems (NGITS). Opher Etzion, Tsvi Kuflik, and Amihai Motro (Eds.), Lecture Notes in Computer Science, vol. 4032, Springer, Berlin, 174--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Segev Wasserkrug, Avigdor Gal, Opher Etzion, and Yulia Turchin. 2008. Complex event processing over uncertain data. In Proceedings of the 2nd International Conference on Distributed Event-Based Systems (DEBS'08). ACM, New York, NY, 253--264. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Yahoo!. 2013. Yahoo! directory: Automotive - makes and models. http://dir.yahoo.com/recreation/automotive/makes_and_models/.Google ScholarGoogle Scholar
  36. Liangzhao Zeng and Hui Lei. 2004. A semantic publish/subscribe system. In Proceedings of the IEEE International Conference on E-Commerce Technology for Dynamic E-Business. 32--39. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Approximate Semantic Matching of Events for the Internet of Things

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Internet Technology
              ACM Transactions on Internet Technology  Volume 14, Issue 1
              Special Issue on Event Recognition
              July 2014
              161 pages
              ISSN:1533-5399
              EISSN:1557-6051
              DOI:10.1145/2659232
              • Editor:
              • Munindar P. Singh
              Issue’s Table of Contents

              Copyright © 2014 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 7 August 2014
              • Accepted: 1 April 2014
              • Revised: 1 March 2014
              • Received: 1 October 2013
              Published in toit Volume 14, Issue 1

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader