skip to main content
10.5555/1268379.1268399guideproceedingsArticle/Chapter ViewAbstractPublication PagesConference Proceedingsacm-pubtype
Article

Linux-HA heartbeat system design

Published:10 October 2000Publication History

ABSTRACT

One of the most commonly identified features which is felt to be necessary for LinuxTM to be considered "enterprise-ready" is High-Availability. High-Availability (HA) systems provide increased service availability through clustering techniques.

HA clusters minimize availability interruptions by quickly switching services over from failed systems to working systems, providing the customer with an illusion of continuous availability. As such, high-availability features, are vital to mission-critical systems. Although there are many components to a high-availability system, two of the key components are heartbeat services and cluster communication services. Heartbeat services provide notification of when nodes are working, and when they fail. In the Linux-HA project, the heartbeat program provides these services and intracluster communication services.

This paper describes the design of the heartbeat program which is part of the High-Availability Linux Project with particular emphasis on the rationales behind key design choices, and the results obtained.

References

  1. {Dan94} Danzig, P. B.: "Flow Control for Limited Buffer Multicast", IEEE Transactions on Software Engineering, Vol. 20, No. 1, January 1994, pp. 1-12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. {Milz99} Milz, Harald: "The Linux High Availability HOWTO". http://meta-lab.unc.edu/pub/linux/ALPHA/linux-ha/High-Availability-HOWTO.htmlGoogle ScholarGoogle Scholar
  3. {Mar00} Martinez, D.: http://linux-ha.org/screenshots.htmlGoogle ScholarGoogle Scholar
  4. {Phi98} In Search of Clusters, by Gregory F. Pfister, 2nd Edition 1998, Prentice Hall PTR.Google ScholarGoogle Scholar
  5. {Ram87} Ramakrishnan, S., Jain, B. N.: "A Negative Acknowledgment with Periodic Polling Protocol for Multicast over LANs". In: Proc. IEEE INFOCOM '87, March 1987, S. 502-511.Google ScholarGoogle Scholar
  6. {Rob00} Robertson, A. L.,: "The High-Availability Linux Project". Http://linux-ha.org/Google ScholarGoogle Scholar
  7. {Twe00} Tweedie, S. C.,: "Barrier Operations". http://linux-ha.org/PhaseII/WhitePapers/sct/barrier.txtGoogle ScholarGoogle Scholar
  8. {Vas00} Vasa, M.,: "The Linux Fail Safe Project". http://oss.sgi.com/projects/failsafe/Google ScholarGoogle Scholar
  9. {Wei00} Weis, R., Geyer, W, Kuhmünch, C., "Architectures for Secure Multicast Communication", In: Proc. SANE 2000 System Administration and Networking Conference, May 22-25, 2000., pp. 63-91.Google ScholarGoogle Scholar

Index Terms

  1. Linux-HA heartbeat system design

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image Guide Proceedings
      ALS'00: Proceedings of the 4th annual Linux Showcase & Conference - Volume 4
      October 2000
      476 pages

      Publisher

      USENIX Association

      United States

      Publication History

      • Published: 10 October 2000

      Qualifiers

      • Article