ABSTRACT
One of the most demanding needs in cloud computing is that of having scalable and highly available databases. One of the ways to attend these needs is to leverage the scalable replication techniques developed in the last decade. These techniques allow increasing both the availability and scalability of databases. Many replication protocols have been proposed during the last decade. The main research challenge was how to scale under the eager replication model, the one that provides consistency across replicas. In this paper, we examine three eager database replication systems available today: Middle-R, C-JDBC and MySQL Cluster using TPC-W benchmark. We analyze their architecture, replication protocols and compare the performance both in the absence of failures and when there are failures.
- Mysql 5.1 reference manual. http://docs.oracle.com/cd/E17952\_01/refman-5.1-en/refman-5.1-en.pdf. Accessed: 2014-06-23.Google Scholar
- TPC Benchmark#8482;W. http://www.tpc.org/tpcw/spec/tpcwv2.pdf, 2003. Accessed: 2014-06-23.Google Scholar
- Mysql proxy guide. http://downloads.mysql.com/docs/mysql-proxy-en.pdf, 2013. Accessed: 2014-06-23.Google Scholar
- Y. Amir and C. Tutu. From total order to database replication. In International Conference on Distributed Computing Systems (ICDCS), pages 494--, 2002. Google ScholarDigital Library
- C. Amza, A. L. Cox, and W. Zwaenepoel. Distributed versioning: Consistent replication for scaling back-end databases of dynamic content web sites. In Middleware, pages 282--304. Springer, 2003. Google ScholarDigital Library
- H. Berenson, P. A. Bernstein, J. Gray, J. Melton, E. J. O'Neil, and P. E. O'Neil. A critique of ansi sql isolation levels. In ACM SIGMOD International Conference on Management Of Data, pages 1--10, 1995. Google ScholarDigital Library
- J. M. Bernabe-Gisbert, F. D. Muñoz Escoi, V. Zuikeviciute, and F. Pedone. A probabilistic analysis of snapshot isolation with partial replication. In Symposium on Reliable Distributed Systems (SRDS), pages 249--258. IEEE, 2008. Google ScholarDigital Library
- E. Cecchet, G. Candea, and A. Ailamaki. Middleware-based database replication: The gaps between theory and practice. CoRR, 2007.Google Scholar
- E. Cecchet, J. Marguerite, and W. Zwaenepoel. C-jdbc: Flexible database clustering middleware. In USENIX Annual Technical Conference, FREENIX Track, pages 9--18. USENIX, 2004. Google ScholarDigital Library
- E. Cecchet, J. Marguerite, and W. Zwaenepoel. Raidb: Redundant array of inexpensive databases. In International Symposium Parallel and Distributed Processing and Applications (ISPA), volume 3358, pages 115--125. Springer, 2004. Google ScholarDigital Library
- G. Chockler, I. Keidar, and R. Vitenberg. Group communication specifications: a comprehensive study. ACM Computing Surveys, pages 427--469, 2001. Google ScholarDigital Library
- A. Correia, J. Pereira, L. Rodrigues, N. Carvalho, R. Vilaca, R. Oliveira, and S. Guedes. Gorda: An open architecture for database replication. In Network Computing and Applications (NCA), pages 287--290. IEEE, 2007.Google ScholarCross Ref
- Daudjee, Khuzaima, and K. Salem. Lazy database replication with snapshot isolation. In International Conference on Very Large Data Bases (VLDB), pages 715--726, 2006. Google ScholarDigital Library
- Elnikety, S. G. Dropsho, and F. Pedone. Tashkent: Uniting durability with transaction ordering for high-performance scalable database replication. In ACMSIGOPS/EuroSys European Conference on Computer Systems, pages 117--130, 2006. Google ScholarDigital Library
- S. Elnikety, F. Pedone, and W. Zwaenepoel. Database replication using generalized snapshot isolation. In Symposium on Reliable Distributed Systems (SRDS), pages 73--84. IEEE, 2005. Google ScholarDigital Library
- J. Gray, P. Helland, P. E. O'Neil, and D. Shasha. The Dangers of Replication and a Solution. In ACM SIGMOD International Conference on Management Of Data, pages 173--182, 1996. Google ScholarDigital Library
- J. Holliday, D. Agrawal, and A. El Abbadi. Partial database replication using epidemic communication. In International Conference on Distributed Computing Systems (ICDCS), pages 485--493, 2002. Google ScholarDigital Library
- R. Jiménez-Peris, M. Patiño-Martínez, and G. Alonso. Non-intrusive, parallel recovery of replicated data. In Symposium on Reliable Distributed Systems (SRDS), pages 150--159. IEEE, 2002. Google ScholarDigital Library
- B. Kemme and G. Alonso. A new approach to developing and implementing eager database replication protocols. ACM Transactions on Database Systems, pages 333--379, 2000. Google ScholarDigital Library
- S. Nicolas, S. Rodrigo, and P. Fernando. Brief announcement: Optimistic algorithms for partial database replication. In S. Dolev, editor, Distributed Computing, pages 557--559. Springer, 2006. Google ScholarDigital Library
- E. Pacitti, C. Coulon, P. Valduriez, and M. T. Özsu. Preventive replication in a database cluster. Distributed and Parallel Databases, pages 223--251, 2005. Google ScholarDigital Library
- M. Patiño-Martínez, R. Jiménez-Peris, B. Kemme, and G. Alonso. Scalable replication in database clusters. In International Conference on Distributed Computing (DISC), pages 315--329. Springer, 2000. Google ScholarDigital Library
- M. Patiño-Martínez, R. Jiménez-Peris, B. Kemme, and G. Alonso. MIDDLE-R: Consistent database replication at the middleware level. ACM Transactions on Computer Systems (TOCS), 23:375--423, 2005. Google ScholarDigital Library
- Plattner, Christian, G. Alonso, and M. T. Özsu. Dbfarm: a scalable cluster for multiple databases. In Proceedings of the ACM/IFIP/USENIX 2006 International Conference on Middleware, pages 180--200. Springer-Verlag, 2006. Google ScholarDigital Library
- U. Rohm, K. Bohm, H. J. Schek, and H. Schuldt. (FAS) - A Freshness-Sensitive Coordination Middleware for a Cluster of OLAP Components. In International Conference on Very Large Data Bases (VLDB), pages 754--765, 2002. Google ScholarDigital Library
- M. Wiesmann, F. Pedone, A. Schiper, B. Kemme, and G. Alonso. Database replication techniques: a three parameter classification. In Symposium on Reliable Distributed Systems (SRDS), pages 206--215. IEEE, 2000. Google ScholarDigital Library
Index Terms
- Performance evaluation of database replication systems
Recommendations
Database replication in large scale systems: optimizing the number of replicas
EDBT/ICDT '09: Proceedings of the 2009 EDBT/ICDT WorkshopsIn distributed systems, replication is used for ensuring availability and increasing performances. However, the heavy workload of distributed systems such as web2.0 applications or Global Distribution Systems, limits the benefit of replication if its ...
Comments