Search Exchange

Search All Sites

Nagios Live Webinars

Let our experts show you how Nagios can help your organization.

Contact Us

Phone: 1-888-NAGIOS-1
Email: sales@nagios.com

Login

Remember Me

Directory Tree

check_griderrors

Rating
0 votes
Favoured:
0
Current Version
0.1
Last Release Date
2014-03-14
Compatible With
  • Nagios 3.x
License
MIT
Hits
24079
Nagios CSP

Meet The New Nagios Core Services Platform

Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.

Monitoring Made Magically Better

  • Nagios Core on Overdrive
  • Powerful Monitoring Dashboards
  • Time-Saving Configuration Wizards
  • Open Source Powered Monitoring On Steroids
  • And So Much More!
Check errors (and performance) of queues in SGE installations
This plugin checks the status of all queues of an SGE installation, warns when a defined number of queues have errors and logs performance data. The host where the script is being executed needs to be a submit or an admin
host of the SGE installation. NRPE is recommended in this case.

============ SETUP NOTES ===============

Use NRPE.
Copy this file to a submit or admin host of your installation.
Adapt the paths of installation directly below this comment.
You will need to set the base path of your SGE installation, the name of your
SGE cell and the path to the common environment settings file of your
installation.
If you have overlapping queues (i.e. some nodes belong to more than one queue)
You will have to ignore some of them, otherwise the total sums of cores will
be wrong.

Example to test, run on submit host:

./check_griderrors.sh -w 1 -c 2

============= SETUP EXAMPLES =============

define command{
command_name check_griderrors.sh
command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -c check_griderrors
}

nrpe.cfg:
command[check_griderrors]=/usr/lib64/nagios/plugins/check_griderrors.sh -w 1 -c 2

define service{
use generic-service
host_name submithost01
service_description Check Griderrors
check_command check_griderrors
normal_check_interval 3
retry_check_interval 1
}