Abstract
To be agile and cost effective, data centers should allow dynamic resource allocation across large server pools. In particular, the data center network should enable any server to be assigned to any service. To meet these goals, we present VL2, a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics. VL2 uses (1) flat addressing to allow service instances to be placed anywhere in the network, (2) Valiant Load Balancing to spread traffic uniformly across network paths, and (3) end-system based address resolution to scale to large server pools, without introducing complexity to the network control plane. VL2's design is driven by detailed measurements of traffic and fault data from a large operational cloud service provider. VL2's implementation leverages proven network technologies, already available at low cost in high-speed hardware implementations, to build a scalable and reliable network architecture. As a result, VL2 networks can be deployed today, and we have built a working prototype. We evaluate the merits of the VL2 design using measurement, analysis, and experiments. Our VL2 prototype shuffles 2.7 TB of data among 75 servers in 395 seconds - sustaining a rate that is 94% of the maximum possible.
- M. Al-Fares, A. Loukissas, and A. Vahdat. A scalable, commodity data center network architecture. In SIGCOMM, 2008. Google ScholarDigital Library
- C. Chang, D. Lee, and Y. Jou. Load balanced Birkhoff-von Neumann switches, part I: one-stage buffering. IEEE HPSR, 2001.Google Scholar
- Cisco. Data center Ethernet. http://www.cisco.com/go/dce.Google Scholar
- Cisco: Data center: Load balancing data center services, 2004.Google Scholar
- K. C. Claffy, H. werner Braun, and G. C. Polyzos. A parameterizable methodology for Internet traffic flow profiling. JSAC, 13, 1995. Google ScholarDigital Library
- W. J. Dally and B. Towles. Principles and Practices of Interconnection Networks. Morgan Kaufmann Publishers, 2004. Google ScholarDigital Library
- N. G. Duffield, P. Goyal, A. G. Greenberg, P. P. Mishra, K. K. Ramakrishnan, and J. E. van der Merwe. A flexible model for resource management in virtual private network. In SIGCOMM, 1999. Google ScholarDigital Library
- D. Farinacci, V. Fuller, D. Oran, D. Meyer, and S. Brim. Locator/ID Separation Protocol (LISP). Internet-draft, Dec. 2008.Google Scholar
- A. Greenberg, P. Lahiri, D. A. Maltz, P. Patel, and S. Sengupta. Towards a next generation data center architecture: Scalability and commoditization. In PRESTO Workshop at SIGCOMM, 2008. Google ScholarDigital Library
- C. Guo, H. Wu, K. Tan, L. Shiy, Y. Zhang, and S. Lu. Dcell: A scalable and fault-tolerant network structure for data centers. In SIGCOMM, 2008. Google ScholarDigital Library
- C. Guo, H. Wu, K. Tan, L. Shiy, Y. Zhang, and S. Lu. Bcube: A high performance, server-centric network architecture for modular data centers. In SIGCOMM, 2009. Google ScholarDigital Library
- J. Hamilton. Cems: Low-cost, low-power servers for internet-scale services. In Conf. on Innovative Data Systems Research, Jan 2009.Google Scholar
- M. Handley, S. Floyd, J. Padhye, and J. Widmer. TCP friendly rate control (TFRC): Protocol specification. RFC 3348, 2003. Google ScholarDigital Library
- R. Jain. The Art of Computer Systems Performance Analysis. John Wiley and Sons, Inc., 1991.Google Scholar
- S. Kandula, D. Katabi, B. Davie, and A. Charny. Walking the Tightrope: Responsive yet Stable Traffic Engineering. In SIGCOMM, 2005. Google ScholarDigital Library
- C. Kim, M. Caesar, and J. Rexford. Floodless in SEATTLE: a scalable ethernet architecture for large enterprises. In SIGCOMM, 2008. Google ScholarDigital Library
- M. Kodialam, T. V. Lakshman, and S. Sengupta. Efficient and Robust Routing of Highly Variable Traffic. In HotNets, 2004.Google Scholar
- L. Lamport. The part-time parliament. ACM Transactions on Computer Systems, 16:133--169, 1998. Google ScholarDigital Library
- M. Mathis, J. Heffner, and R. Raghunarayan. TCP extended statistics MIB. RFC 4898, 2007.Google Scholar
- S. Sinha, S. Kandula, and D. Katabi. Harnessing TCP's burstiness with flowlet switching. In HotNets, 2004.Google Scholar
- Y. Zhang and Z. Ge. Finding critical traffic matrices. In DSN, June 2005.Google Scholar
- R. Zhang-Shen and N. McKeown. Designing a Predictable Internet Backbone Network. In HotNets, 2004.Google Scholar
Index Terms
- VL2: a scalable and flexible data center network
Recommendations
VL2: a scalable and flexible data center network
SIGCOMM '09: Proceedings of the ACM SIGCOMM 2009 conference on Data communicationTo be agile and cost effective, data centers should allow dynamic resource allocation across large server pools. In particular, the data center network should enable any server to be assigned to any service. To meet these goals, we present VL2, a ...
Data center TCP (DCTCP)
SIGCOMM '10Cloud data centers host diverse applications, mixing workloads that require small predictable latency with others requiring large sustained throughput. In this environment, today's state-of-the-art TCP protocol falls short. We present measurements of a ...
Data center TCP (DCTCP)
SIGCOMM '10: Proceedings of the ACM SIGCOMM 2010 conferenceCloud data centers host diverse applications, mixing workloads that require small predictable latency with others requiring large sustained throughput. In this environment, today's state-of-the-art TCP protocol falls short. We present measurements of a ...
Comments