ABSTRACT
We investigate the problem of designing a scalable overlay network to support decentralized topic-based pub/sub communication. We introduce a new optimization problem, called Minimum Topic-Connected Overlay (Min-TCO), that captures the tradeoff between the scalability of the overlay (in terms of the nodes' fanout) and the message forwarding overhead incurred by the communicating parties. Roughly, the Min-TCO problem is as follows: Given a collection of nodes and their subscriptions, connect the nodes using the minimum possible number of edges so that for each topic t, a message published on t could reach all the nodes interested in t by being forwarded by onlythe nodes interested in t.
We show that the decision version of Min-TCO is NP-complete, and present a polynomial algorithm that approximates the optimal solution within a logarithmic factor with respect to the number of edges in theconstructed overlay. We further prove that this approximation ratio is almost tight by showing that no polynomial algorithm can approximate Min-TCO within a constant factor (unless P=NP). We show experimentally that on typical inputs, the fanout of the overlay constructed by our approximation algorithm is significantly lower thanthat of the overlays built by the existing algorithms, and that its running time is just a small fraction of the analytical worst case bound. As Min-TCO can be shown to capture several important aspects of most known overlay-based pub/sub implementations, our study sheds light on the inherent limitations of the existing systems as well asprovides an insight into the best possible feasible solution.
Finally, we introduce a flexible framework that generalizes Min-TCO and formalizes most similar overlay design problems that occur in scalable pub/sub systems. We also briefly discuss several examples of such problems, and show some results with respect to their complexity.
- Oracle9i Application Developers GuideAdvanced Queuing. Oracle, Redwood Shores, CA.Google Scholar
- E. Anceaume, M. Gradinariu, A. K. Datta, G. Simon, and A. Virgillito. A semantic overlay for self-* peer-to-peer publish/subscribe. In ICDCS, 2006. Google ScholarDigital Library
- S. Baehni, P. T. Eugster, and E. Guerraoui. Data-aware multicast. In DSN, 2004. Google ScholarDigital Library
- R. Baldoni, R. Beraldi, V. Quema, L. Querzoni, and S. T. Piergiovanni. TERA: Topic-based Event Routing for Peer-to-Peer Architectures. In 1th International Conference on Distributed Event-Based Systems (DEBS). ACM, 6 2007. Google ScholarDigital Library
- R. Baldoni, R. Beraldi, L. Querzoni, and A. Virgillito. Efficient publish/subscribe through a self-organizing broker overlay and its application to SIENA. The Computer Journal, 2007. Google ScholarDigital Library
- S. Banerjee, B. Bhattacharjee, and C. Kommareddy. Scalable application layer multicast. SIGCOMM Comput. Commun. Rev., 32(4):205--217, 2002. Google ScholarDigital Library
- S. Bhola, R. Strom, S. Bagchi, Y. Zhao, and J. Auerbach. Exactly-once delivery in a content-based publish-subscribe system. In DSN, 2002. Google ScholarDigital Library
- A. Carzaniga, M. J. Rutherford, and A. L. Wolf. A routing scheme for content-based networking. In Proceedings of IEEE INFOCOM 2004, Hong Kong, China, Mar. 2004.Google ScholarCross Ref
- M. Castro, P. Druschel, A.-M. Kermarrec, and A. Rowstron. SCRIBE: a large-scale and decentralized application-level multicast infrastructure. IEEE J. Selected Areas in Comm. (JSAC), 20(8):1489--1499, 2002. Google ScholarDigital Library
- R. Chand and P. Felber. Semantic peer-to-peer overlays for publish/subscribe networks. In Euro-Par 2005 Parallel Processing, Lecture Notes in Computer Science, volume 3648, pages 1194--1204. Springer Verlag, 2005. Google ScholarDigital Library
- G. Chockler, R. Melamed, Y. Tock, and R. Vitenberg. Constructing scalable overlay networks for pub/sub with many topics. Manuscript in progress, 2007. Google ScholarDigital Library
- G. Chockler, R. Melamed, Y. Tock, and R. Vitenberg. SpiderCast: A Scalable Interest-Aware Overlay for Topic-Based Pub/Sub Communication. In 1th International Conference on Distributed Event-Based Systems (DEBS). ACM, 6 2007. Google ScholarDigital Library
- P. T. Eugster, P. A. Felber, R. Guerraoui, and A.-M. Kermarrec. The many faces of publish/subscribe. ACM Computing Surveys, 35(2):114--131, 2003. Google ScholarDigital Library
- R. Guerraoui, S. Handurukande, and A.-M. Kermarrec. Gosskip: a gossip-based structured overlay network for efficient content-based filtering. Technical Report IC/2004/95, EPFL, Lausanne, 2004.Google Scholar
- R. Lewis. Advanced Messaging Applications with MSMQ and MQSeries. QUE, 1999.Google Scholar
- H. Liu, V. Ramasubramanian, and E. G. Sirer. Client behavior and feed characteristics of rss, a publish-subscribe system for web micronews. In Internet Measurement Conference (IMC), Berkeley, California, October 2005. Google ScholarDigital Library
- V. Ramasubramanian, R. Peterson, and E. G. Sirer. Corona: A high performance publish-subscribe system for the world wide web. In NSDI, 2006. Google ScholarDigital Library
- D. Sandler, A. Mislove, A. Post, and P. Druschel. Feedtree: Sharing web micronews with peer-to-peer event notification. In International Workshop on Peer-to-Peer Systems (IPTPS), 2005. Google ScholarDigital Library
- D. Tam, R. Azimi, and H.-A. Jacobsen. Building content-based publish /subscribe systems with distributed hash tables. In 1st Intl. Workshop on Databases, Information Systems, and P2P Computing (DBISP2P), Berlin, Germany, 2003.Google Scholar
- Y. Tock, N. Naaman, A. Harpaz, and G. Gershinsky. Hierarchical clustering of message flows in a multicast data dissemination system. In 17th IASTED Int'l Conf. Parallel and Distributed Computing and Systems, pages 320--327, 2005.Google Scholar
- S. Voulgaris, E. Riviere, A.-M. Kermarrec, and M. van Steen. Sub-2-sub: Self-organizing content-based publish subscribe for dynamic large scale collaborative networks. In IPTPS, 2006.Google Scholar
- S. Zhuang, B. Zhao, A. Joseph, R. Katz, and J. Kubiatowicz. Bayeux: An architecture for scalable and fault tolerant wide-area data dissemination. In 11th International Workshop Network and Operating System Support for Digital Audio and Video (NOSSDAV), pages 11--20, June 2001. Google ScholarDigital Library
Index Terms
- Constructing scalable overlays for pub-sub with many topics
Recommendations
Araneola: A scalable reliable multicast system for dynamic environments
This paper presents Araneola (Araneola means ''little spider'' in Latin.), a scalable reliable application-level multicast system for highly dynamic wide-area environments. Araneola supports multi-point to multi-point reliable communication in a fully ...
SplitStream: high-bandwidth multicast in cooperative environments
SOSP '03: Proceedings of the nineteenth ACM symposium on Operating systems principlesIn tree-based multicast systems, a relatively small number of interior nodes carry the load of forwarding multicast messages. This works well when the interior nodes are highly-available, dedicated infrastructure routers but it poses a problem for ...
Indexing through Querying in Unstructured Peer-to-Peer Overlay Networks
APNOMS '08: Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service ManagementThe efficiency of a Peer-to-Peer file sharing overlay is measured in terms of the scalability and versatility of its object lookup strategy. In these networks peers carry out distributed query relaying to discover the service providers. Existing lookup ...
Comments