ABSTRACT
The reality of multicore machines as a standard and the prevalence of distributed cloud computing has signaled a need for parallel and distributed computing to become integrated into the computer science curriculum. At the same time, operating system virtualization has become a common technique with open standard tools available to any practitioners. Virtual machines (VMs) installed on available computer lab resources can be used to simulate high-performance cluster computing environments. This paper describes two such virtual clusters in use at small colleges, demonstrates their effectiveness for parallel computing education, and provides information about how to obtain the VMs for use in an educational lab setting. We have used these clusters to introduce parallelism into several courses in our undergraduate curriculum.
- A. Gaspar, S. Langevin, and W. Armitage, "Virtualization Technologies in the Undergraduate IT Curriculum," IT Professional, vol. 9, no. 4, pp. 10--17, 2007. Google ScholarDigital Library
- A. Goscinski and A. Wong, "Performance evaluation of the concurrent execution of NAS parallel benchmarks with BYTE sequential benchmarks on a cluster," in Parallel and Distributed Systems, 2005. Proceedings. 11th International Conference on, vol. 1, pp. 313--319 Vol. 1, 2005. Google ScholarDigital Library
- M. Bergman, J. Funston, and P. Glifeather-Crowley, "Low-cost compute clusters in virtualized environments," Journal of Comuting Sciences in Colleges, vol. 25, no. 1, pp. 159--166, Oct. 2009. Google ScholarDigital Library
- R. Brown et al., "Strategies for preparing computer science students for the multicore world," in Proceedings of the 2010 ITiCSE working group reports on Working group reports - ITiCSE-WGR '10, p. 97, 2010. Google ScholarDigital Library
- NSF/IEEE-TCPP Curriculum Initiative on Parallel and Distributed Computing -- Core Topics for Undergraduates." {Online}. Available: http://www.cs.gsu.edu/~tcpp/curriculum/index.php. {Accessed: 02-Sep-2011}.Google Scholar
- R. A. Brown, "Hadoop at home: large-scale computing at a small college," in Proceedings of the 40th ACM technical symposium on Computer science education, pp. 106--110, 2009. Google ScholarDigital Library
- Apache Hadoop Project." {Online}. Available: http://hadoop.apache.org/. {Accessed: 01-Sep-2011}.Google Scholar
- M. Johnson, R. H. Liao, A. Rasmussen, R. Sridharan, D. Garcia, and B. K. Harvey, Infusing Parallelism into Introductory Computer Science Curriculum using MapReduce. EECS Department, University of California, Berkeley, 2008.Google Scholar
- P. Garrity, T. Yates, R. Brown, and E. Shoop, "WebMapReduce: an accessible and adaptable tool for teaching map-reduce computing," in Proceedings of the 42nd ACM technical symposium on Computer science education - SIGCSE '11, p. 183, 2011. Google ScholarDigital Library
- E. Johnson, P. Garrity, T. Yates, and R. Brown, "Performance of a Virtual Cluster in a General-Purpose Teaching Laboratory," in IEEE Cluster Conference, p. to appear, 2011. Google ScholarDigital Library
- P. Luszczek et al., "Introduction to the HPC Challenge Benchmark Suite," Lawrence Berkeley National Laboratory, LBNL Paper LBNL-57493, 2005. Retreived from http://www.escholarship.org/uc/item/6sv079jp, 03-Sep-2011.Google Scholar
- Open MPI: Open Source High Performance Computing." {Online}. Available: http://www.open-mpi.org/. {Accessed: 03-Sep-2011}.Google Scholar
- TORQUE Resource Manager." {Online}. Available: http://www.clusterresources.com/products/torque-resource-manager.php. {Accessed: 02-Sep-2011}.Google Scholar
- R. Rabenseifner, "Effective I/O Bandwidth (b_eff_io) Benchmark." {Online}. Available: https://fs.hlrs.de/projects/par/mpi//b_eff_io/. {Accessed: 19-Aug-2011}.Google Scholar
- R. Brown and E. Shoop, "Modules in community: injecting more parallelism into the computer science curriculum," in Proceedings of the 42nd ACM technical symposium on Computer science education - SIGCSE '11, p. 447, 2011. Google ScholarDigital Library
Index Terms
- Virtual clusters for parallel and distributed education
Recommendations
Evaluating Student Engagement towards Integrating Parallel and Distributed Computing (PDC) Topics in Undergraduate Level Computer Science Curriculum
SIGCSE '19: Proceedings of the 50th ACM Technical Symposium on Computer Science EducationThis research is a cross university study of evaluating whether there are differences between demographic factors (gender, age, class standing) and student engagement via unplugged activities introducing parallel and distributed computing (PDC) topics ...
Implementation of parallel graph algorithms on a massively parallel SIMD computer with virtual processing
IPPS '95: Proceedings of the 9th International Symposium on Parallel ProcessingWe describe our implementation, with virtual processing, of several parallel graph algorithms on a 16,384-processor MasPar MP-1. We present extensive test data on our code.
Exploiting Distributed-Memory and Shared-Memory Parallelism on Clusters of SMPs with Data Parallel Programs
Clusters of SMPs are hybrid-parallel architectures that combine the main concepts of distributed-memory and shared-memory parallel machines. Although SMP clusters are widely used in the high performance computing community, there exists no single ...
Comments