ABSTRACT
The SAP HANA database is the core of SAP's new data management platform. The overall goal of the SAP HANA database is to provide a generic but powerful system for different query scenarios, both transactional and analytical, on the same data representation within a highly scalable execution environment. Within this paper, we highlight the main features that differentiate the SAP HANA database from classical relational database engines. Therefore, we outline the general architecture and design criteria of the SAP HANA in a first step. In a second step, we challenge the common belief that column store data structures are only superior in analytical workloads and not well suited for transactional workloads. We outline the concept of record life cycle management to use different storage formats for the different stages of a record. We not only discuss the general concept but also dive into some of the details of how to efficiently propagate records through their life cycle and moving database entries from write-optimized to read-optimized storage formats. In summary, the paper aims at illustrating how the SAP HANA database is able to efficiently work in analytical as well as transactional workload environments.
- S. K. Cha and C. Song. P*TIME: Highly scalable OLTP DBMS for managing update-intensive stream workload. In VLDB, pages 1033--1044, 2004. Google ScholarDigital Library
- F. Färber, S. K. Cha, J. Primsch, C. Bornhövd, S. Sigg, and W. Lehner. SAP HANA database - data management for modern business applications. SIGMOD Record, 40(4):45--51, 2011. Google ScholarDigital Library
- G. Graefe. Query evaluation techniques for large databases. ACM Comput. Surv., 25(2):73--170, 1993. Google ScholarDigital Library
- P. Große, W. Lehner, T. Weichert, F. Färber, and W.-S. Li. Bridging two worlds with RICE integrating R into the SAP in-memory computing engine. PVLDB, 4(12):1307--1317, 2011.Google Scholar
- G. Hill and A. Ross. Reducing outer joins. VLDB J., 18(3):599--610, 2009. Google ScholarDigital Library
- B. Jaecksch, F. Färber, F. Rosenthal, and W. Lehner. Hybrid Data-Flow Graphs for Procedural Domain-Specific Query Languages. In SSDBM Conference, pages 577--578, 2011. Google ScholarDigital Library
- B. Jaecksch, W. Lehner, and F. Färber. A plan for OLAP. In EDBT conference, pages 681--686, 2010. Google ScholarDigital Library
- T. Legler, W. Lehner, and A. Ross. Data mining with the sap netweaver bi accelerator. In VLDB, pages 1059--1068, 2006. Google ScholarDigital Library
- C. Lemke, K.-U. Sattler, F. Färber, and A. Zeier. Speeding up queries in column stores - a case for compression. In DaWak, pages 117--129, 2010. Google ScholarDigital Library
- M. Paradies, C. Lemke, H. Plattner, W. Lehner, K.-U. Sattler, A. Zeier, and J. Krüger. How to juggle columns: an entropy-based approach for table compression. In IDEAS, pages 205--215, 2010. Google ScholarDigital Library
- H. Plattner. A common database approach for OLTP and OLAP using an in-memory column database. In SIGMOD Conference, pages 1--2, 2009. Google ScholarDigital Library
- H. Plattner and A. Zeier. In-Memory Data Management: An Inflection Point for Enterprise Applications. Springer, Berlin Heidelberg, 2011. Google ScholarDigital Library
- M. Stonebraker, S. Madden, D. J. Abadi, S. Harizopoulos, N. Hachem, and P. Helland. The end of an architectural era (it's time for a complete rewrite). In VLDB, pages 1150--1160, 2007. Google ScholarDigital Library
- F. Transier and P. Sanders. Engineering basic algorithms of an in-memory text search engine. ACM Trans. Inf. Syst., 29(1):2, 2010. Google ScholarDigital Library
- T. Willhalm, N. Popovici, Y. Boshmaf, H. Plattner, A. Zeier, and J. Schaffner. SIMD-scan: ultra fast in-memory table scan using on-chip vector processing units. Proc. VLDB, 2:385--394, August 2009. Google ScholarDigital Library
- M. Winslett. Bruce Lindsay Speaks Out. SIGMOD Record, 34:71, June 2005. Google ScholarDigital Library
Index Terms
- Efficient transaction processing in SAP HANA database: the end of a column store myth
Recommendations
SAP HANA database: data management for modern business applications
The SAP HANA database is positioned as the core of the SAP HANA Appliance to support complex business analytical processes in combination with transactionally consistent operational workloads. Within this paper, we outline the basic characteristics of ...
Comments