ABSTRACT
One of the most commonly identified features which is felt to be necessary for LinuxTM to be considered "enterprise-ready" is High-Availability. High-Availability (HA) systems provide increased service availability through clustering techniques.
HA clusters minimize availability interruptions by quickly switching services over from failed systems to working systems, providing the customer with an illusion of continuous availability. As such, high-availability features, are vital to mission-critical systems. Although there are many components to a high-availability system, two of the key components are heartbeat services and cluster communication services. Heartbeat services provide notification of when nodes are working, and when they fail. In the Linux-HA project, the heartbeat program provides these services and intracluster communication services.
This paper describes the design of the heartbeat program which is part of the High-Availability Linux Project with particular emphasis on the rationales behind key design choices, and the results obtained.
- {Dan94} Danzig, P. B.: "Flow Control for Limited Buffer Multicast", IEEE Transactions on Software Engineering, Vol. 20, No. 1, January 1994, pp. 1-12. Google ScholarDigital Library
- {Milz99} Milz, Harald: "The Linux High Availability HOWTO". http://meta-lab.unc.edu/pub/linux/ALPHA/linux-ha/High-Availability-HOWTO.htmlGoogle Scholar
- {Mar00} Martinez, D.: http://linux-ha.org/screenshots.htmlGoogle Scholar
- {Phi98} In Search of Clusters, by Gregory F. Pfister, 2nd Edition 1998, Prentice Hall PTR.Google Scholar
- {Ram87} Ramakrishnan, S., Jain, B. N.: "A Negative Acknowledgment with Periodic Polling Protocol for Multicast over LANs". In: Proc. IEEE INFOCOM '87, March 1987, S. 502-511.Google Scholar
- {Rob00} Robertson, A. L.,: "The High-Availability Linux Project". Http://linux-ha.org/Google Scholar
- {Twe00} Tweedie, S. C.,: "Barrier Operations". http://linux-ha.org/PhaseII/WhitePapers/sct/barrier.txtGoogle Scholar
- {Vas00} Vasa, M.,: "The Linux Fail Safe Project". http://oss.sgi.com/projects/failsafe/Google Scholar
- {Wei00} Weis, R., Geyer, W, Kuhmünch, C., "Architectures for Secure Multicast Communication", In: Proc. SANE 2000 System Administration and Networking Conference, May 22-25, 2000., pp. 63-91.Google Scholar
Index Terms
- Linux-HA heartbeat system design
Recommendations
The Adaptive Heartbeat Design of High Availability RAID Dual-Controller
MUE '08: Proceedings of the 2008 International Conference on Multimedia and Ubiquitous EngineeringDesigning storage systems to provide high availability in the face of failures needs the use of various data protection techniques, such as dual-controller RAID. The failure of RAID controller may cause RAID storage system to fail to respond to ongoing ...
Design and Implementation of High Availability Distributed System Based on Multi-level Heartbeat Protocol
CASE '09: Proceedings of the 2009 IITA International Conference on Control, Automation and Systems Engineering (case 2009)Heartbeat mechanism is widely used in designing high availability distributed system. In this paper, we design a multi-level heartbeat protocol, which consists of two kinds of heartbeat protocols: multipoint heartbeat protocol and bidirectional ...
A noninvasive heartbeat, respiration, and body movement monitoring system for neonates
Long-term monitoring of heart rate, respiratory rate, and body movement in neonates would be useful for preventing illnesses, such as respiratory disorders. However, currently, there is no noninvasive monitoring system available for neonates. Therefore, ...
Comments