skip to main content
research-article

Can the elephants handle the NoSQL onslaught?

Published:01 August 2012Publication History
Skip Abstract Section

Abstract

In this new era of "big data", traditional DBMSs are under attack from two sides. At one end of the spectrum, the use of document store NoSQL systems (e.g. MongoDB) threatens to move modern Web 2.0 applications away from traditional RDBMSs. At the other end of the spectrum, big data DSS analytics that used to be the domain of parallel RDBMSs is now under attack by another class of NoSQL data analytics systems, such as Hive on Hadoop. So, are the traditional RDBMSs, aka "big elephants", doomed as they are challenged from both ends of this "big data" spectrum? In this paper, we compare one representative NoSQL system from each end of this spectrum with SQL Server, and analyze the performance and scalability aspects of each of these approaches (NoSQL vs. SQL) on two workloads (decision support analysis and interactive data-serving) that represent the two ends of the application spectrum. We present insights from this evaluation and speculate on potential trends for the future.

References

  1. CouchDB. http://couchdb.apache.org/Google ScholarGoogle Scholar
  2. Hadoop. http://hadoop.apache.org/Google ScholarGoogle Scholar
  3. Hive. http://hive.apache.org/Google ScholarGoogle Scholar
  4. Hive Issue 2081. https://issues.apache.org/jira/browse/HIVE-2081Google ScholarGoogle Scholar
  5. Hive Issue 2130. https://issues.apache.org/jira/browse/HIVE-2130Google ScholarGoogle Scholar
  6. Microsoft SQL Server 2008 R2 Parallel Data Warehouse. http://www.microsoft.com/sqlserver/en/us/solutions-technologies/data-warehousing/pdw.aspxGoogle ScholarGoogle Scholar
  7. MongoDB. http://www.mongodb.org/Google ScholarGoogle Scholar
  8. MongoDB -- Replica Sets. http://www.mongodb.org/display/DOCS/Replica+SetsGoogle ScholarGoogle Scholar
  9. MongoDB - Splitting Chunk Shards. http://www.mongodb.org/display/DOCS/Splitting+Shard+ChunksGoogle ScholarGoogle Scholar
  10. MongoDB - Mongostat. http://www.mongodb.org/display/DOCS/mongostatGoogle ScholarGoogle Scholar
  11. Riak. http://wiki.basho.com/Google ScholarGoogle Scholar
  12. Running TPC-H queries on Hive. https://issues.apache.org/jira/browse/HIVE-600Google ScholarGoogle Scholar
  13. The TPC-H Benchmark. http://www.tpc.org/tpch/Google ScholarGoogle Scholar
  14. B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. Benchmarking Cloud Serving Systems with YCSB. In SoCC, pages 143--154, 2010. Google ScholarGoogle Scholar
  15. M. Y. Eltabakh, Y. Tian, F. Özcan, Rainer Gemulla, Aljoscha Krettek, John McPherson: CoHadoop: Flexible Data Placement and Its Exploitation in Hadoop. PVLDB 4(9): 575--585, 2011. Google ScholarGoogle Scholar
  16. A. Floratou, J. M. Patel, E. J. Shekita, and S. Tata. Column-Oriented Storage Techniques for MapReduce. PVLDB, 4(7): 419--429, 2011. Google ScholarGoogle Scholar
  17. Y. He, R. Lee, Y. Huai, Z. Shao, N. Jain, X. Zhang, and Z. Xu. RCFile: A Fast and Space-efficient Data Placement Structure in MapReduce-based Warehouse Systems. In ICDE, pages 1199--1208, 2011. Google ScholarGoogle Scholar
  18. T.Kaldewey, E. J. Shekita, and S. Tata. Clydesdale: Structured Data Processing on MapReduce. In EDBT, pages 15--25, 2012. Google ScholarGoogle Scholar
  19. A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, and M. Stonebraker. A Comparison of Approaches to Large-Scale Data Analysis. In SIGMOD, pages 165--178, 2009. Google ScholarGoogle Scholar
  20. A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka, N. Zhang, S. Antony, H. Liu, and R. Murthy. Hive: A Petabyte Scale Data Warehouse Using Hadoop. In ICDE, pages 996--1005, 2010.Google ScholarGoogle Scholar

Index Terms

  1. Can the elephants handle the NoSQL onslaught?
    Index terms have been assigned to the content through auto-classification.

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image Proceedings of the VLDB Endowment
      Proceedings of the VLDB Endowment  Volume 5, Issue 12
      August 2012
      340 pages

      Publisher

      VLDB Endowment

      Publication History

      • Published: 1 August 2012
      Published in pvldb Volume 5, Issue 12

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader