skip to main content
Skip header Section
Monitoring with GangliaNovember 2012
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

Written by Ganglia designers and maintainers, this book shows you how to collect and visualize metrics from clusters, grids, and cloud infrastructures at any scale. Want to track CPU utilization from 20,000 hosts every ten seconds? Ganglia is just the tool you need, once you know how its main components work together. This hands-on book helps experienced system administrators take advantage of Ganglia 3.x. Learn how to extend the base set of metrics you collect, fetch current values, see aggregate views of metrics, and observe time-series trends in your data. Youll also examine real-world case studies of Ganglia installs that feature challenging monitoring requirements. Determine whether Ganglia is a good fit for your environment Learn how Ganglias gmond and gmetad daemons build a metric collection overlay Plan for scalability early in your Ganglia deployment, with valuable tips and advice Take data visualization to a new level with gweb, Ganglias web frontend Write plugins to extend gmonds metric-collection capability Troubleshoot issues you may encounter with a Ganglia installation Integrate Ganglia with the sFlow and Nagios monitoring systems Contributors include: Robert Alexander, Jeff Buchbinder, Frederiko Costa, Alex Dean, Dave Josephsen, Peter Phaal, and Daniel Pocock.

Cited By

  1. ACM
    Poppe O, Lei C, Rundensteiner E and Maier D Event Trend Aggregation Under Rich Event Matching Semantics Proceedings of the 2019 International Conference on Management of Data, (555-572)
  2. Adekoya O, Sabiu H, Eager D, Grassmann W and Makaroff D A case study of spark resource configuration and management for image processing applications Proceedings of the 28th Annual International Conference on Computer Science and Software Engineering, (18-29)
  3. ACM
    Yasay J Web Server Utilization Using Common off-the-shelf Computers Proceedings of the 4th International Conference on Industrial and Business Engineering, (174-180)
  4. ACM
    Langin C We Have an HPC System Proceedings of the Practice and Experience in Advanced Research Computing 2017 on Sustainability, Success and Impact, (1-8)
  5. García-Saiz D, Zorrilla M and Bosque J (2017). A clustering-based knowledge discovery process for data centre infrastructure management, The Journal of Supercomputing, 73:1, (215-226), Online publication date: 1-Jan-2017.
  6. ACM
    Canuto M, Bosch R, Macias M and Guitart J A methodology for full-system power modeling in heterogeneous data centers Proceedings of the 9th International Conference on Utility and Cloud Computing, (20-29)
  7. ACM
    Sarajlic S, Edirisinghe N, Lukinov Y, Walters M, Davis B and Faroux G Orion Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale, (1-5)
  8. ACM
    Gómez-Iglesias A, Rosales C and Evans T Practical Monitoring of Resource Utilization for HPC Applications Proceedings of the XSEDE16 Conference on Diversity, Big Data, and Science at Scale, (1-8)
  9. ACM
    Birngruber E, Forai P and Zauner A Total recall Proceedings of the Second International Workshop on HPC User Support Tools, (1-12)
  10. ACM
    Ardagna C, Asal R, Damiani E and Vu Q (2015). From Security to Assurance in the Cloud, ACM Computing Surveys, 48:1, (1-50), Online publication date: 29-Sep-2015.
  11. Bychkov I, Oparin G, Novopashin A and Sidorov I Agent-Based Approach to Monitoring and Control of Distributed Computing Environment Proceedings of the 13th International Conference on Parallel Computing Technologies - Volume 9251, (253-257)
  12. ACM
    Zhang H, Diao Y and Immerman N On complexity and optimization of expensive queries in complex event processing Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (217-228)
Contributors
  • University of California, Berkeley
  • InMon Corporation

Recommendations