Abstract
Event processing follows a decoupled model of interaction in space, time, and synchronization. However, another dimension of semantic coupling also exists and poses a challenge to the scalability of event processing systems in highly semantically heterogeneous and dynamic environments such as the Internet of Things (IoT). Current state-of-the-art approaches of content-based and concept-based event systems require a significant agreement between event producers and consumers on event schema or an external conceptual model of event semantics. Thus, they do not address the semantic coupling issue. This article proposes an approach where participants only agree on a distributional statistical model of semantics represented in a corpus of text to derive semantic similarity and relatedness. It also proposes an approximate model for relaxing the semantic coupling dimension via an approximation-enabled rule language and an approximate event matcher. The model is formalized as an ensemble of semantic and top-k matchers along with a probability model for uncertainty management. The model has been empirically validated on large sets of events and subscriptions synthesized from real-world smart city and energy management systems. Experiments show that the proposed model achieves more than 95% F1Score of effectiveness and thousands of events/sec of throughput for medium degrees of approximation while not requiring users to have complete prior knowledge of event semantics. In semantically loosely-coupled environments, one approximate subscription can compensate for hundreds of exact subscriptions to cover all possibilities in environments which require complete prior knowledge of event semantics. Results indicate that approximate semantic event processing could play a promising role in the IoT middleware layer.
- Bogdan Alexe, Wang-Chiew Tan, and Yannis Velegrakis. 2008. STBenchmark: Towards a benchmark for mapping systems. Proc. VLDB Endow. 1, 1 (2008), 230--244. Google ScholarDigital Library
- Kyle Anderson, Adrian Ocneanu, Diego Benitez, Derrick Carlson, Anthony Rowe, and Mario Berges. 2012. BLUED: A fully labeled public dataset for event-based non-intrusive load monitoring research. In Proceedings of the 2nd KDD Workshop on Data Mining Applications in Sustainability (SustKDD).Google Scholar
- Luigi Atzori, Antonio Iera, and Giacomo Morabito. 2010. The Internet of Things: A survey. Comput. Netw. 54, 15 (2010), 2787--2805. Google ScholarDigital Library
- Zohra Bellahsene, Angela Bonifati, Fabien Duchateau, and Yannis Velegrakis. 2011. On evaluating schema matching and mapping. In Schema Matching and Mapping, Springer, Bertin, 253--291.Google Scholar
- Antonio Carzaniga, David S. Rosenblum, and Alexander L. Wolf. 2000. Achieving scalability and expressiveness in an Internet-scale event notification service. In Proceedings of the 19th Annual ACM Symposium on Principles of Distributed Computing. ACM, 219--227. Google ScholarDigital Library
- Gianpaolo Cugola and Alessandro Margara. 2012. Processing flows of information: From data stream to complex event processing. ACM Comput. Surv. 44, 3, Article 15 (2012). Google ScholarDigital Library
- Edward Curry, Souleiman Hasan, and Sean O'Riain. 2012. Enterprise energy management using a linked dataspace for energy intelligence. In Proceedings of Sustainable Internet and ICT for Sustainability (SustainIT). IEEE, 1--6.Google Scholar
- Richard Cyganiak. 2013. Rooms in the DERI building. http://lab.linkeddata.deri.ie/2010/deri-rooms.Google Scholar
- Hong-Hai Do, Sergey Melnik, and Erhard Rahm. 2003. Comparison of schema matching evaluations. In Proceedings of the Revised Papers from the NODe Web and Database-Related Workshop on Web, Web-Services, and Database Systems. Akmal B. Chaudhri, Mario Jeckle, Erhard Rahm, and Rainer Unland (Eds.), Lecture Notes in Computer Science, vol. 2593, Springer, Berlin Heidelberg, 221--237. Google ScholarDigital Library
- EsperTech. 2013. Esper complex event processing engine. http://esper.codehaus.org/.Google Scholar
- Patrick Th. Eugster, Pascal A. Felber, Rachid Guerraoui, and Anne-Marie Kermarrec. 2003. The many faces of publish/subscribe. ACM Comput. Surv. 35, 2 (2003), 114--131. Google ScholarDigital Library
- Françoise Fabret, François Llirbat, Joao Pereira, I. Rocquencourt, and Dennis Shasha. 2000. Efficient matching for content-based publish/subscribe systems. In Proceedings of the International Conference on Cooperative Information Systems (CoopIS). Google ScholarDigital Library
- Evgeniy Gabrilovich and Shaul Markovitch. 2007. Computing semantic relatedness using Wikipedia-based explicit semantic analysis. In Proceedings of the 20th International Joint Conference on Artifical Intelligence (IJCAI'07). Morgan Kaufmann Publishers Inc., San Francisco, CA, 1606--1611. Google ScholarDigital Library
- Avigdor Gal. 2006. Managing uncertainty in schema matching with top-K schema mappings. In Journal on Data Semantics VI, Stefano Spaccapietra, Karl Aberer, and Philippe Cudr-Mauroux (Eds.), Lecture Notes in Computer Science, vol. 4090, Springer, Berlin Heidelberg, 90--114. Google ScholarDigital Library
- Avigdor Gal. 2011. Uncertain schema matching. Synthesis Lect. Data Manage. 3, 1 (2011), 1--97.Google ScholarCross Ref
- Zellig S. Harris. 1954. Distributional structure. Word 10 (1954), 146--162.Google ScholarCross Ref
- Souleiman Hasan, Sean O'Riain, and Edward Curry. 2012. Approximate semantic matching of heterogeneous events. In Proceedings of the 6th ACM International Conference on Distributed Event-Based Systems (DEBS'12). 252--263. Google ScholarDigital Library
- Souleiman Hasan, Sean O'Riain, and Edward Curry. 2013. Towards unified and native enrichment in event processing systems. In Proceedings of the 7th ACM International Conference on Distributed Event-Based Systems (DEBS'13). 171--182. Google ScholarDigital Library
- Jing He, Yanchun Zhang, Guangyan Huang, and Jinli Cao. 2012. A smart Web service based on the context of things. ACM Trans. Internet Technol. 11, 3, Article 13 (2012). Google ScholarDigital Library
- Gregor Hohpe and Bobby Woolf. 2004. Enterprise Integration Patterns: Designing, Building, and Deploying Messaging Solutions. Addison-Wesley Professional. Google ScholarDigital Library
- Katja Hose and Akrivi Vlachou. 2012. A survey of skyline processing in highly distributed environments. VLDB 21, 3 (2012), 359--384. Google ScholarDigital Library
- Danh Le-Phuoc, Minh Dao-Tran, Josiane Xavier Parreira, and Manfred Hauswirth. 2011. A native and adaptive approach for unified processing of linked streams and linked data. In Proceedings of the 10th International Semantic Web Conference (ISWC). Lecture Notes in Computer Science, vol. 7031, Springer, Berlin Heidelberg, 370--388. Google ScholarDigital Library
- Yoonkyong Lee, Mayssam Sayyadian, AnHai Doan, and Arnon S. Rosenthal. 2007. eTuner: Tuning schema matching software using synthetic scenarios. VLDB 16, 1 (2007), 97--122. Google ScholarDigital Library
- Haifeng Liu and H.-Arno Jacobsen. 2002. A-TOPSS: A publish/subscribe system supporting approximate matching. In Proceedings of the 28th International Conference on Very Large Data Bases (VLDB'02). VLDB Endowment, 1107--1110. Google ScholarDigital Library
- Zhen Liu, Srinivasan Parthasarathy, Anand Ranganathan, and Hao Yang. 2008. Near-optimal algorithms for shared filter evaluation in data stream systems. In Proceedings of the ACM SIGMOD International Conference on Management of Data. ACM, 133--146. Google ScholarDigital Library
- Merriam-Webster. 2012. Merriam-Webster's Collegiate Thesaurus. http://www.dictionaryapi.com/products/api-collegiate-thesaurus.htm.Google Scholar
- George A. Miller. 1995. WordNet: A lexical database for English. Commun. ACM 38, 11 (1995), 39--41. Google ScholarDigital Library
- Gero Mühl, Ludger Fiege, and Peter Pietzuch. 2006. Distributed Event-Based Systems. Vol. 1. Springer.Google Scholar
- OECD. 2012. Machine-to-machine communications: Connecting billions of devices. http://www.oecd-ilibrary.org.Google Scholar
- Milenko Petrovic, Ioana Burcea, and Hans-Arno Jacobsen. 2003. S-ToPSS: Semantic Toronto publish/subscribe system. In Proceedings of the 29th International Conference on Very Large Data Bases (VLDB'03). VLDB Endowment, 1101--1104. Google ScholarDigital Library
- Luis Sanchez, José Antonio Galache, Veronica Gutierrez, J. M. Hernandez, J. Bernat, Alex Gluhak, and Tomás Garcia. 2011. SmartSantander: The meeting point between future Internet research and experimentation and the smart cities. In Proceedings of the Future Network & Mobile Summit (FutureNetw). IEEE, 1--8.Google Scholar
- Jinling Wang, Beihong Jin, and Jing Li. 2004. An ontology-based publish/subscribe system. In Proceedings of the 5th ACM/IFIP/USENIX International Conference on Middleware (Middleware'04). Springer-Verlag, Berlin Heidelber, 232--253. Google ScholarDigital Library
- Segev Wasserkrug, Avigdor Gal, and Opher Etzion. 2006. A taxonomy and representation of sources of uncertainty in active systems. In Proceedings of the 6th International Conference on Next Generation Information Technologies and Systems (NGITS). Opher Etzion, Tsvi Kuflik, and Amihai Motro (Eds.), Lecture Notes in Computer Science, vol. 4032, Springer, Berlin, 174--185. Google ScholarDigital Library
- Segev Wasserkrug, Avigdor Gal, Opher Etzion, and Yulia Turchin. 2008. Complex event processing over uncertain data. In Proceedings of the 2nd International Conference on Distributed Event-Based Systems (DEBS'08). ACM, New York, NY, 253--264. Google ScholarDigital Library
- Yahoo!. 2013. Yahoo! directory: Automotive - makes and models. http://dir.yahoo.com/recreation/automotive/makes_and_models/.Google Scholar
- Liangzhao Zeng and Hui Lei. 2004. A semantic publish/subscribe system. In Proceedings of the IEEE International Conference on E-Commerce Technology for Dynamic E-Business. 32--39. Google ScholarDigital Library
Index Terms
- Approximate Semantic Matching of Events for the Internet of Things
Recommendations
Thematic event processing
Middleware '14: Proceedings of the 15th International Middleware ConferenceEvent-based systems follow a decoupled mode of interaction between event producers and consumers in space, time, and synchronization to enable scalability within distributed systems. We recognize a fourth dimension of coupling due to the need for mutual ...
Approximate semantic matching of heterogeneous events
DEBS '12: Proceedings of the 6th ACM International Conference on Distributed Event-Based SystemsEvent-based systems have loose coupling within space, time and synchronization, providing a scalable infrastructure for information exchange and distributed workflows. However, event-based systems are tightly coupled, via event subscriptions and ...
Using Semantic Web for Internet of Things Interoperability: A Systematic Review
The main vision of the Internet of Things IoT is to enable seamless connection between physical devices and information systems to improve the lives of people. One of the main obstacles to achieve this vision is the current lack of IoT interoperability. ...
Comments