ABSTRACT
The ever-increasing scale of scientific data has become a significant challenge for researchers that rely on networks to interact with remote computing systems and transfer results to collaborators worldwide. Despite the availability of high-capacity connections, scientists struggle with inadequate cyberinfrastructure that cripples data transfer performance, and impedes scientific progress. The Science DMZ paradigm comprises a proven set of network design patterns that collectively address these problems for scientists. We explain the Science DMZ model, including network architecture, system configuration, cybersecurity, and performance tools, that creates an optimized network environment for science. We describe use cases from universities, supercomputing centers and research laboratories, highlighting the effectiveness of the Science DMZ model in diverse operational settings. In all, the Science DMZ model is a solid platform that supports any science workflow, and flexibly accommodates emerging network technologies. As a result, the Science DMZ vastly improves collaboration, accelerating scientific discovery.
- C. Alexander. The Timeless Way of Building. Oxford University Press, New York, 1979.Google Scholar
- C. Alexander, S. Ishikawa, and M. Silverstein. A Pattern Language: Towns, Buildings, Construction. Oxford University Press, New York, August 1977.Google Scholar
- W. Allcock, J. Bresnahan, R. Kettimuthu, M. Link, C. Dumitrescu, I. Raicu, and I. Foster. The Globus Striped GridFTP Framework and Server. In Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, SC '05, page 54, Washington, DC, USA, 2005. IEEE Computer Society. Google ScholarDigital Library
- B. Allen, J. Bresnahan, L. Childers, I. Foster, G. Kandaswamy, R. Kettimuthu, J. Kordas, M. Link, S. Martin, K. Pickett, et al. Software as a service for data scientists. Communications of the ACM, 55(2):81--88, 2012. Google ScholarDigital Library
- I. T. Association. InfiniBand. Architecture Specification Release 1.2.1 Annex A16: RoCE, 2010.Google Scholar
- E. Gamma, R. Helm, R. Johnson, and J. Vlissides. Design patterns: elements of reusable object-oriented software. Addison-Wesley Longman Publishing Co., Inc., Boston, MA, USA, 1995. Google ScholarDigital Library
- C. Guok, D. Robertson, M. Thompson, J. Lee, B. Tierney, and W. Johnston. Intra and Interdomain Circuit Provisioning Using the OSCARS Reservation System. In Third International Conference on Broadband Communications Networks and Systems, IEEE/ICST, October 2006.Google Scholar
- V. Jacobson, R. Braden, and D. Borman. TCP Extensions for High Performance. RFC 1323 (Proposed Standard), May 1992. Google ScholarDigital Library
- W. E. Johnston, E. Dart, M. Ernst, and B. Tierney. Enabling high throughput in widely distributed data management and analysis systems: Lessons from the LHC. In TERENA Networking Conference (TNC) 2013, June 2013.Google Scholar
- E. Kissel, G. Fernandes, M. Jaffee, M. Swany, and M. Zhang. Driving software defined networks with xsp. In Workshop on Software Defined Networks (SDN'12), International Conference on Communications (ICC). IEEE, June 2012.Google ScholarCross Ref
- E. Kissel, B. Tierney, M. Swany, and E. Pouyoul. Efficient Data Transfer Protocols for Big Data. In Proceedings of the 8th International Conference on eScience. IEEE, July 2012. Google ScholarDigital Library
- M. Mathis, J. Semke, J. Mahdavi, and T. Ott. The macroscopic behavior of the tcp congestion avoidance algorithm. SIGCOMM Comput. Commun. Rev., 27(3):67--82, July 1997. Google ScholarDigital Library
- N. McKeown, T. Anderson, H. Balakrishnan, G. Parulkar, L. Peterson, J. Rexford, S. Shenker, and J. Turner. Openflow: enabling innovation in campus networks. SIGCOMM Comput. Commun. Rev., 38(2):69--74, Mar. 2008. Google ScholarDigital Library
- I. Monga, C. Guok, W. E. Johnston, and B. Tierney. Hybrid Networks: Lessons Learned and Future Challenges Based on ESnet4 Experience. In IEEE Communications Magazine, May 2011.Google ScholarCross Ref
- J. Postel. Transmission Control Protocol. Request for Comments (Standard) 793, Internet Engineering Task Force, September 1981.Google ScholarDigital Library
- B. Tierney, J. Boote, E. Boyd, A. Brown, M. Grigoriev, J. Metzger, M. Swany, M. Zekauskas, and J. Zurawski. perfSONAR: Instantiating a Global Network Measurement Framework. In SOSP Workshop on Real Overlays and Distributed Systems (ROADS '09), Big Sky, Montana, USA, Oct. 2009. ACM.Google Scholar
- J. Zurawski, R. Ball, A. Barczyk, M. Binkley, J. Boote, E. Boyd, A. Brown, R. Brown, T. Lehman, S. McKee, B. Meekhof, A. Mughal, H. Newman, S. Rozsa, P. Sheldon, A. Tackett, R. Voicu, S. Wolff, and X. Yang. The dynes instrument: A description and overview. Journal of Physics: Conference Series, 396(4):042065, 2012.Google ScholarCross Ref
Index Terms
- The Science DMZ: a network design pattern for data-intensive science
Recommendations
Practical Cyber-Security Solutions for the Science DMZ
PEARC '19: Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning)The Science DMZ provides high-bandwidth and high-throughput private connectivity to or among computing resources. However, the performance comes at a cost: there is minimal security on the Science DMZ. This is by design, as conventional enterprise ...
Major Factors in Science DMZ Deployment at Small Institutions
PEARC '18: Proceedings of the Practice and Experience on Advanced Research ComputingThe Science DMZ is a network research tool offering superior large science data transmission between two locations. Through a network design that places the Science DMZ at the edge of the campus network, the Science DMZ defines a network path that ...
The Science DMZ: A network design pattern for data-intensive science
SC13 --The International Conference for High Performance Computing, Networking, Storage and AnalysisThe ever-increasing scale of scientific data has become a significant challenge for researchers that rely on networks to interact with remote computing systems and transfer results to collaborators worldwide. Despite the availability of high-capacity ...
Comments