skip to main content
research-article
Free Access

MapReduce: a flexible data processing tool

Authors Info & Claims
Published:01 January 2010Publication History
Skip Abstract Section

Abstract

MapReduce advantages over parallel databases include storage-system independence and fine-grain fault tolerance for large jobs.

References

  1. Abouzeid, A., Bajda-Pawlikowski, K., Abadi, D.J., Silberschatz, A., and Rasin, A. HadoopDB: An architectural hybrid of MapReduce and DBMS technologies for analytical workloads. In Proceedings of the Conference on Very Large Databases (Lyon, France, 2009); http://db.cs.yale.edu/hadoopdb/ Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Aster Data Systems, Inc. In-Database MapReduce for Rich Analytics; http://www.asterdata.com/product/mapreduce.php.Google ScholarGoogle Scholar
  3. Chang, F., Dean, J., Ghemawat, S., Hsieh, W.C., Wallach, D.A., Burrows, M., Chandra, T., Fikes, A., and Gruber, R.E. Bigtable: A distributed storage system for structured data. In Proceedings of the Seventh Symposium on Operating System Design and Implementation (Seattle, WA, Nov. 6--8). Usenix Association, 2006; http://labs.google.com/papers/bigtable.html Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Dean, J. and Ghemawat, S. MapReduce: Simplified data processing on large clusters. In Proceedings of the Sixth Symposium on Operating System Design and Implementation (San Francisco, CA, Dec. 6--8). Usenix Association, 2004; http://labs.google.com/papers/mapreduce.html Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Dewitt, D. and Stonebraker, M. MapReduce: A Major Step Backwards blogpost; http://databasecolumn.vertica.com/database-innovation/mapreduce-a-major-step-backwards/Google ScholarGoogle Scholar
  6. Dewitt, D. and Stonebraker, M. MapReduce II blogpost; http://databasecolumn.vertica.com/database-innovation/mapreduce-ii/Google ScholarGoogle Scholar
  7. Ghemawat, S., Gobioff, H., and Leung, S.-T. The Google file system. In Proceedings of the 19th ACM Symposium on Operating Systems Principles (Lake George, NY, Oct. 19--22). ACM Press, New York, 2003; http://labs.google.com/papers/gfs.html Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Google. Protocol Buffers: Google's Data Interchange Format. Documentation and open source release; http://code.google.com/p/protobuf/Google ScholarGoogle Scholar
  9. Greenplum. Greenplum MapReduce: Bringing Next-Generation Analytics Technology to the Enterprise; http://www.greenplum.com/resources/mapreduce/Google ScholarGoogle Scholar
  10. Hadoop. Documentation and open source release; http://hadoop.apache.org/core/Google ScholarGoogle Scholar
  11. Hadoop. Users list; http://wiki.apache.org/hadoop/PoweredByGoogle ScholarGoogle Scholar
  12. Olston, C., Reed, B., Srivastava, U., Kumar, R., and Tomkins, A. Pig Latin: A not-so-foreign language for data processing. In Proceedings of the ACM SIGMOD 2008 International Conference on Management of Data (Auckland, New Zealand, June 2008); http://hadoop.apache.org/pig/ Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Pavlo, A., Paulson, E., Rasin, A., Abadi, D.J., DeWitt, D.J., Madden, S., and Stonebraker, M. A comparison of approaches to large-scale data analysis. In Proceedings of the 2009 ACM SIGMOD International Conference (Providence, RI, June 29--July 2). ACM Press, New York, 2009; http://database.cs.brown.edu/projects/mapreduce-vs-dbms/ Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Pike, R., Dorward, S., Griesemer, R., and Quinlan, S. Interpreting the data: Parallel analysis with Sawzall. Scientific Programming Journal, Special Issue on Grids and Worldwide Computing Programming Models and Infrastructure 13, 4, 227--298. http://labs.google.com/papers/sawzall.html Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. MapReduce: a flexible data processing tool

                          Recommendations

                          Comments

                          Login options

                          Check if you have access through your login credentials or your institution to get full access on this article.

                          Sign in

                          Full Access

                          • Published in

                            cover image Communications of the ACM
                            Communications of the ACM  Volume 53, Issue 1
                            Amir Pnueli: Ahead of His Time
                            January 2010
                            142 pages
                            ISSN:0001-0782
                            EISSN:1557-7317
                            DOI:10.1145/1629175
                            Issue’s Table of Contents

                            Copyright © 2010 ACM

                            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

                            Publisher

                            Association for Computing Machinery

                            New York, NY, United States

                            Publication History

                            • Published: 1 January 2010

                            Permissions

                            Request permissions about this article.

                            Request Permissions

                            Check for updates

                            Qualifiers

                            • research-article
                            • Popular
                            • Refereed

                          PDF Format

                          View or Download as a PDF file.

                          PDF

                          eReader

                          View online with eReader.

                          eReader

                          HTML Format

                          View this article in HTML Format .

                          View HTML Format