skip to main content
research-article

On Folded-Clos Networks with Deterministic Single-Path Routing

Published:29 January 2016Publication History
Skip Abstract Section

Abstract

Folded-Clos networks, also known as fat-trees, have been widely used as interconnects in large-scale high-performance computing clusters. Although users often treat such interconnects as replacements of nonblocking crossbar switches that can carry out any permutation communication without contention, the networking capability of such interconnects without a centralized controller in computer communication environments is not well understood. In this article, we investigate nonblocking two-level folded-Clos networks with deterministic single-path routing, but no centralized controller, and establish the nonblocking condition. The results indicate that nonblocking two-level folded-Clos networks without a centralized controller are much more expensive to construct than the traditional nonblocking networks in the telecommunication environment. Practical two-level folded-Clos based interconnects are blocking. For such interconnects, we establish the lower bound for worst-case contention for permutations with any deterministic single-path routing scheme, show that existing routing schemes perform poorly in terms of worst-case contention for permutations, present a routing scheme that achieves the theoretical optimal, and empirically compare the performance of existing schemes with the optimal routing scheme. The techniques developed for two-level folded-Clos networks are further extended for the general fat-trees of any heights.

References

  1. Y. Aydogan, C. B. Stunkel, C. Aykanat, and B. Abali. 1996. Adaptive source routing in multistage interconnection networks. In Proceedings of the 10th International Parallel Processing Symposium. 258--267. DOI:http://dx.doi.org/10.1109/IPPS.1996.508067 Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. V. E. Benes. 1962. On rearrangeable three-stage connecting networks. Bell System Technical Journal 41, 5, 1481--1492.Google ScholarGoogle ScholarCross RefCross Ref
  3. V. E. Benes. 1965. Mathematical Theory of Connecting Networks and Telephone Traffic. Vol. 17. Academic Press.Google ScholarGoogle Scholar
  4. Jehoshua Bruck, Ching-Tien Ho, Eli Upfal, Shlomo Kipnis, and Derrick Weathersby. 1997. Efficient algorithms for all-to-all communications in multiport message-passing systems. IEEE Transactions on Parallel and Distributed Systems 8, 11, 1143--1156. DOI:http://dx.doi.org/10.1109/71.642949 Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Charles Clos. 1953. A study of non-blocking switching networks. Bell System Technical Journal 32, 2, 406--424. DOI:http://dx.doi.org/10.1002/j.1538-7305.1953.tb01433.xGoogle ScholarGoogle ScholarCross RefCross Ref
  6. J. Flich, M. P. Malumbres, P. Lopez, and J. Duato. 2000. Improving routing performance in Myrinet networks. In Proceedings of the 14th International Parallel and Distributed Processing Symposium (IPDPS'00). 27--32. DOI:http://dx.doi.org/10.1109/IPDPS.2000.845961 Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. P. Geoffray and T. Hoefler. 2008. Adaptive routing strategies for modern high performance networks. In Proceedings of the 16th IEEE Symposium on High Performance Interconnects. 165--172. DOI:http://dx.doi. org/10.1109/HOTI.2008.21 Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Gomez, F. Gilabert, M. E. Gomez, P. Lopez, and J. Duato. 2007. Deterministic versus adaptive routing in fat-trees. In Proceedings of the 21st IEEE International Parallel and Distributed Processing Symposium. 1--8. DOI:http://dx.doi.org/10.1109/IPDPS.2007.370482Google ScholarGoogle Scholar
  9. Ronald I. Greenberg and C. E. Leiserson. 1985. Randomized routing on fat-tress. In Proceedings of the 26th Annual Symposium on Foundations of Computer Science. 241--249. DOI:http://dx.doi.org/10.1109/ SFCS.1985.46 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. T. Hoefler, T. Schneider, and A. Lumsdaine. 2008. Multistage switches are not crossbars: Effects of static routing in high-performance networks. In Proceedings of the 2008 IEEE International Conference on Cluster Computing. 116--125. DOI:http://dx.doi.org/10.1109/CLUSTR.2008.4663762Google ScholarGoogle ScholarCross RefCross Ref
  11. InfiniBandTM Trade Association. 2004. InfiniBandTM Architecture Specification. Release 1.2.Google ScholarGoogle Scholar
  12. G. Johnson, D. K. Kerbyson, and M. Lang. 2008. Optimization of InfiniBand for scientific applications. In Proceedings of the 2008 IEEE International Symposium on Parallel and Distributed Processing (IPDPS'08). 1--8. DOI:http://dx.doi.org/10.1109/IPDPS.2008.4536345Google ScholarGoogle ScholarCross RefCross Ref
  13. Ajai Kapoor and Romeo Rizzi. 2000. Edge-coloring bipartite graphs. Journal of Algorithms 34, 2, 390--396. DOI:http://dx.doi.org/10.1006/jagm.1999.1058 Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. John Kim, William J. Dally, and Dennis Abts. 2006. Adaptive routing in high-radix Clos network. In Proceedings of the ACM/IEEE SC 2006 Conference. 7--7. DOI:http://dx.doi.org/10.1109/SC.2006.10 Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. E. Leiserson. 1985. Fat-trees: Universal networks for hardware-efficient supercomputing. IEEE Transactions on Computing C-34, 10, 892--901. DOI:http://dx.doi.org/10.1109/TC.1985.6312192 Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Xuan-Yi Lin, Yeh-Ching Chung, and Tai-Yi Huang. 2004. A multiple LID routing scheme for fat-tree-based InfiniBand networks. In Proceedings of the 18th International Parallel and Distributed Processing Symposium. 11. DOI:http://dx.doi.org/10.1109/IPDPS.2004.1302913Google ScholarGoogle Scholar
  17. P. Lopez, J. Flich, and J. Duato. 2001. Deadlock-free routing in InfiniBandTM through destination renaming. In Proceedings of the 2001 International Conference on Parallel Processing. 427--434. DOI:http://dx.doi.org/10.1109/ICPP.2001.952089 Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Wickus Nienaber, Xin Yuan, and Zhenhai Duan. 2009. LID assignment in InfiniBand networks. IEEE Transactions on Parallel and Distributed Systems 20, 4, 484--497. DOI:http://dx.doi.org/10.1109/TPDS. 2008.144 Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. S. R. Ohring, M. Ibel, S. K. Das, and M. J. Kumar. 1995. On generalized fat trees. In Proceedings of the 9th International Parallel Processing Symposium. 37--44. DOI:http://dx.doi.org/10.1109/IPPS.1995.395911 Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. F. Petrini and M. Vanneschi. 1997. K-ary N-trees: High performance networks for massively parallel architectures. In Proceedings of the 11th International Parallel Processing Symposium. 87--93. DOI:http://dx.doi.org/10.1109/IPPS.1997.580853 Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. G. Rodriguez, C. Minkenberg, R. Beivide, R. P. Luijten, J. Labarta, and M. Valero. 2009. Oblivious routing schemes in extended generalized fat tree networks. In Proceedings of the 2009 IEEE International Conference on Cluster Computing and Workshops. 1--8. DOI:http://dx.doi.org/10.1109/CLUSTR.2009.5289145Google ScholarGoogle ScholarCross RefCross Ref
  22. T. Schneider, T. Hoefler, and A. Lumsdaine. 2009. ORCS: An Oblivious Routing Congestion Simulator. TR-675. Indiana University Computer Science.Google ScholarGoogle Scholar
  23. A. Singh. 2005. Load-Balanced Routing in Interconnection Networks. Ph.D. Dissertation. Stanford University, Stanford, CA.Google ScholarGoogle Scholar
  24. Yuanyuan Yang and Jianchao Wang. 1999. Wide-sense nonblocking Clos networks under packing strategy. IEEE Transactions on Computers 48, 3, 265--284. DOI:http://dx.doi.org/10.1109/12.754994 Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Xin Yuan. 2011. On nonblocking folded-Clos networks in computer communication environments. In Proceedings of the 2011 IEEE International Parallel Distributed Processing Symposium (IPDPS'11). 188--196. DOI:http://dx.doi.org/10.1109/IPDPS.2011.27 Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. X. Yuan, W. Nienaber, Z. Duan, and R. Melhem. 2009. Oblivious routing in fat-tree based system area networks with uncertain traffic demands. IEEE/ACM Transactions on Networking 17, 5, 1439--1452. DOI:http://dx.doi.org/10.1109/TNET.2009.2012853 Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Eitan Zahavi, Greg Johnson, Darren J. Kerbyson, and Michael Lang. 2010. Optimized InfiniBand fat-tree routing for shift all-to-all communication patterns. Concurrency and Computation: Practice and Experience 22, 2, 217--231. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Eitan Zahavi. 2012. Fat-tree routing and node ordering providing contention free traffic for MPI global collectives. Journal of Parallel and Distributed Computing 72, 11, 1423--1432. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. On Folded-Clos Networks with Deterministic Single-Path Routing

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM Transactions on Parallel Computing
        ACM Transactions on Parallel Computing  Volume 2, Issue 4
        Special Issue on PPOPP 2014
        March 2016
        202 pages
        ISSN:2329-4949
        EISSN:2329-4957
        DOI:10.1145/2888415
        Issue’s Table of Contents

        Copyright © 2016 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 29 January 2016
        • Accepted: 1 October 2015
        • Revised: 1 November 2014
        • Received: 1 September 2013
        Published in topc Volume 2, Issue 4

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article
        • Research
        • Refereed

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader