ABSTRACT
If you build it, will they come? Not necessarily. A critical need exists for knowledge in managing and properly utilizing supercomputing at mid-level and smaller research institutions. Simply having HPC hardware and some software is not enough. This paper relates the administrative experience of the first several months of a mid-level doctoral university providing a new enterprise XSEDE [15] Compatible Basic Cluster (XCBC) [3,4,5] high performance computing cluster to faculty and other researchers, including the experiences of first-day urgencies, initial problems in the first few weeks, and establishing an ongoing management system.
- Greg Bruno, Mason J. Katz, Frederico D. Sacerdoti, Philip M. Papadopoulos. Rolls: Modifying a Standard System Installer to Support User-Customizable Cluster Frontend Appliances. In IEEE International Conference on Cluster Computing. Washington, DC, September 2004. Google ScholarDigital Library
- Chui-hui Chiu, Nathan Lewis, Dipak Kumar Singh, Arghya Kusum Das, Mohammad M Jalazai, Richard Platania, Sayan Goswami, Kisung Lee, Seung-Jong Park. BIC-LSU: Big Data Research Integration with Cyberinfrastructure for LSU. XSEDE16. July, 2016. Miami. Google ScholarDigital Library
- Eric Coulter, Jeremy Fischer, Barbara Hallock, Richard Knepper, and Craig Steward. 2016. Implementation of Simple XSEDE-Like Clusters: Science Enabled and Lessons Learned. XSEDE16 (July 17-21, 2016), Miami. Google ScholarDigital Library
- Jeremy Fischer, Richard Knepper, Matthew Standish, Craig A. Stewart, Barbara Hallock, Resa Alvord, Victor Hazlewood, David Lifka. Methods for Creating XSEDE Compatible Clusters. XSEDE14, July, 2014. Atlanta, GA. Google ScholarDigital Library
- Jeremy Fischer, Richard Knepper, Eric Coulter, Charles Peck, and Craig A. Stewart. XCBC and XNIT -- Tools for Cluster Implementation and Management in Research and Training. In Proceedings of the 2015 IEEE International Conference on Cluster Computing. September 8-11, 2015, pp. 857--864. Google ScholarDigital Library
- Ian Foster, Rajkumar Kettimuthu, Stuart Martin, Steve Tuecke, Thomas Hauser, Daniel Milroy, Brock Palen, and Jazcek Braden, Campus Bridging Made Easy via Globus Services, XSEDE 2012, Chicago, IL, 2012. Google ScholarDigital Library
- M. J. Katz, P. M. Papadopoulos, and G. Bruno. Leveraging Standard Core Technologies to Programmatically Build Linux Cluster Appliances. In Proceedings of 2002 IEEE International Conference on Cluster Computing, Chicago, IL, October 2002. Google ScholarDigital Library
- Rajkumar Kettimuthu, Lukasz Lacinski, Mike Link, Karl Pickett, Steve Tuecke, and Ian Foster, Instant GridFTP, 9th Workshop on High Performance Grid and Cloud Computing, 2012. Google ScholarDigital Library
- Tom Madden. The BLAST Sequence Analysis Tool. 2013.Google Scholar
- Matt Massie, Bernard Li, Brad Nicholes, Vladimir Vuksan, Robert Alexander, Jeff Buchbinder, Frederiko Costa, Alex Dean, Dave Josephsen, Peter Phaal, Daniel Pocock. Monitoring with Ganglia. 2012. O'Reilly Media, Inc. Google ScholarDigital Library
- P. M. Papadopoulos, M. J. Katz, and G. Bruno. NPACI Rocks: Tools and Techniques for Easily Deploying Manageable Linux Clusters. In Proceedings of 2001 IEEE International Conference on Cluster Computing, Newport, CA, October 2001. Google ScholarDigital Library
- Philip M. Papadopoulos, Mason J. Katz, and Greg Bruno. NPACI Rocks: Tools and Techniques for Easily Deploying Manageable Linux Clusters. In Concurrency, Practice and Experience, 15(7-8):707--725, 2003.Google ScholarCross Ref
- Semir Sarajlic, Neranjan Edirisinghe, Yuriy Lukinov, Michael Walters, Brock Davis, and Gregori Faroux. Orion: Discovery Environment for HPC Research and Bridging XSEDE Resources. XSEDE16, July, 2016. Miami, FL. Google ScholarDigital Library
- Craig A. Stewart, Richard Knepper, James Ferguson, Felix Bachmann, Victor Hazlewood, Ian Foster, Andrew Grimshaw, and David Lifka. What is Campus Bridging and What is XSEDE Doing About It? XSEDE12, July, 2012. Chicago, IL. Google ScholarDigital Library
- John Towns, Timothy Cockerill, Maytal Dahan, Ian Foster, Kelly Gaither, Andrew Grimshaw, Victor Hazlewood, Scott Lathrop, Dave Lifka, Gregory D. Peterson, Ralph Roskies, J. Ray Scott, Nancy Wilkins-Diehr, "XSEDE: Accelerating Scientific Discovery", Computing in Science & Engineering, vol.16, no. 5, pp. 62--74, Sept.-Oct. 2014.Google Scholar
Index Terms
- We Have an HPC System: Now What?
Recommendations
From BigDog to BigDawg: Transitioning an HPC Cluster for Sustainability
PEARC '19: Proceedings of the Practice and Experience in Advanced Research Computing on Rise of the Machines (learning)This paper relates the experiences of managing the transition of a high performance computing (HPC) cluster from Rocks, SGE, and Cisco to OpenHPC, SLURM, and Dell. This transition was made because of sustainability issues related to security, the ...
Implementation of Simple XSEDE-Like Clusters: Science Enabled and Lessons Learned
XSEDE16: Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at ScaleThe Extreme Science and Engineering Discovery Environment (XSEDE) has created a suite of software that is collectively known as the XSEDE-Compatible Basic Cluster (XCBC). It is designed to enable smaller, resource-constrained research groups or ...
XCBC and XNIT - Tools for Cluster Implementation and Management in Research and Training
CLUSTER '15: Proceedings of the 2015 IEEE International Conference on Cluster ComputingThe Extreme Science and Engineering Discovery Environment has created a suite of software collectively known as the XSEDE-compatible basic cluster (XCBC). It has been distributed as a Rocks Roll for some time. The same scientific and supporting packages ...
Comments