Home Directory Plugins Clustering and High-Availability nrpe plugin: check_lsf_host.sh

Search Exchange

Search All Sites

Nagios Live Webinars

Let our experts show you how Nagios can help your organization.

Contact Us

Phone: 1-888-NAGIOS-1
Email: sales@nagios.com

Login

Remember Me

Directory Tree

nrpe plugin: check_lsf_host.sh

Rating
0 votes
Favoured:
0
Current Version
1.5
Last Release Date
2011-09-21
Compatible With
  • Nagios 3.x
License
GPL
Hits
91514
Files:
FileDescription
check_lsf_host.shcheck_lsf_host.sh
check_lsf_host.shcheck_lsf_host.sh
Nagios CSP

Meet The New Nagios Core Services Platform

Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.

Monitoring Made Magically Better

  • Nagios Core on Overdrive
  • Powerful Monitoring Dashboards
  • Time-Saving Configuration Wizards
  • Open Source Powered Monitoring On Steroids
  • And So Much More!
nrpe plugin: check_lsf_host.sh
Plugin for nagios nrpe to report on local host lsf status and performance data. Uses bhosts and lsload lsf commands. Performance data tested with pnp4nagios, which produces a time graph for each metrix in the lsloads listing.

Why is this run on the local host rather than querying each host status via the lsf master? Because it fits in nicely with the pnp4nagios architecture, which produces historical graphs on lsload data. pnp4nagios works on a per host basis.

Why is this written as a shell script rather than perl/python/c. We need to run lsf commands and we need to source lsf env vars to find the commands. These commands live in different places depending on the architecture of the system (intel/sun/powerpc) and whether its 32 or 64 bit. Thus a shell script is most portable across multiple platforms/architectures (assuming you have a bash interpreter and posix standard environment with tools like awk).

I have also written a check_lsf_master.sh which checks the master, and populates the performance data with LSF queue information.


V1.2 17 Aug 2011
Performance data output fully complient to nagios standard.
Check for lsf required daemons before checking host via lsf.

V1.3 01 Sep 2011
Can have more than one sbatchd running; change test from 'eq 1' to 'ge 1'.

V1.4 15 Sep 2011
eauth daemon does not get spun up until its required. If it dies, res restarts it. So no need to check for it.
Changed closed_full from warning to ok. Depends on how you want to interpret this. We don't want to know hosts are full; you might want to know.

V1.5 21 Sep 2011 Changed closed_Excl from warning to OK.