skip to main content
10.1145/2157136.2157310acmconferencesArticle/Chapter ViewAbstractPublication PagessigcseConference Proceedingsconference-collections
research-article

Experiences teaching MapReduce in the cloud

Published:29 February 2012Publication History

ABSTRACT

We describe our experiences teaching MapReduce in a large undergraduate lecture course using public cloud services. Using the cloud, every student could carry out scalability benchmarking assignments on realistic hardware, which would have been impossible otherwise. Over two semesters, over 500 students took our course. We believe this is the first large-scale demonstration that it is feasible to use pay-as-you-go billing in the Cloud for a large undergraduate course. Modest instructor effort was sufficient to prevent students from overspending. Average per-pupil expenses in the Cloud were under $45, less than half our available grant funding. Students were excited by the assignment: 90% said they thought it should be retained in future course offerings.

References

  1. M. Armbrust, A. Fox, R. Griffith, A. Joseph, R. Katz, A. Konwinski, G. Lee, D. Patterson, A. Rabkin, et al. Above the Clouds: A Berkeley View of Cloud Computing. Technical Report 2009--28, UC Berkeley, 2009.Google ScholarGoogle Scholar
  2. R. A. Brown. Hadoop at home: large-scale computing at a small college. In SIGCSE, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Cloudera, inc. Configuring and Running CDH Cloud Scripts. Retrieved August 31, 2011 from https://ccp.cloudera.com/display/CDH2DOC/Configuring+and+Running+CDH+Cloud+Scripts, 2011.Google ScholarGoogle Scholar
  4. A. Couch. Comp150 CPA. Retrieved August 21, 2011 from http://www.cs.tufts.edu/comp/150CPA/, 2011.Google ScholarGoogle Scholar
  5. J. Dean and S. Ghemawat. MapReduce: Simplified Data Processing on Large Clusters. Commun. ACM, Volume 51(Issue 1):107--113, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Garrity, T. Yates, R. Brown, and E. Shoop. WebMapReduce: an accessible and adaptable tool for teaching map-reduce computing. In SIGCSE, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Hirai, S. Raghavan, H. Garcia-Molina, and H. Paepcke. WebBase: A repository of web pages. In WWW, May 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Johnson, R. H. Liao, A. Rasmussen, R. Sridharan, D. D. Garcia, and B. Harvey. Infusing Parallelism into Introductory Computer Science Curriculum using MapReduce. Technical Report EECS-2008--34, UC Berkeley, 2008.Google ScholarGoogle Scholar
  9. A. Kimball, S. Michels-Slettvet, and C. Bisciglia. Cluster computing for web-scale data processing. In SIGCSE, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. Lin. Data-Intensive Information Processing Applications. Retrieved August 21, 2011 from http://www.umiacs.umd.edu/ jimmylin/cloud-2010-Spring/info.html, 2011.Google ScholarGoogle Scholar
  11. D. J. Malan. Moving cs50 into the cloud. J. Comput. Small Coll., 25:111--120, June 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank Citation Ranking: Bringing Order to the Web. Technical report, Stanford Digital Library Technologies Project, 1998.Google ScholarGoogle Scholar
  13. C. Shaoul and C. Westbury. A usenet corpus. Retrieved August 21, 2011 from http://www.psych.ualberta.ca/ westburylab/downloads/usenetcorpus.download.html, May 2011.Google ScholarGoogle Scholar

Index Terms

  1. Experiences teaching MapReduce in the cloud

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      SIGCSE '12: Proceedings of the 43rd ACM technical symposium on Computer Science Education
      February 2012
      734 pages
      ISBN:9781450310987
      DOI:10.1145/2157136

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 29 February 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      SIGCSE '12 Paper Acceptance Rate100of289submissions,35%Overall Acceptance Rate1,595of4,542submissions,35%

      Upcoming Conference

      SIGCSE Virtual 2024

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader