Home Directory Plugins System Metrics Processes Check long running processes

Search Exchange

Search All Sites

Nagios Live Webinars

Let our experts show you how Nagios can help your organization.

Contact Us

Phone: 1-888-NAGIOS-1
Email: sales@nagios.com

Login

Remember Me

Directory Tree

Check long running processes

Rating
1 vote
Favoured:
0
Current Version
1.11
Last Release Date
2013-11-04
Compatible With
  • Nagios 3.x
Owner
License
GPL
Hits
35334
Files:
FileDescription
check_long_running_procs.shcheck_long_running_procs.sh
Nagios CSP

Meet The New Nagios Core Services Platform

Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.

Monitoring Made Magically Better

  • Nagios Core on Overdrive
  • Powerful Monitoring Dashboards
  • Time-Saving Configuration Wizards
  • Open Source Powered Monitoring On Steroids
  • And So Much More!
A plugin to check long running processes on Linux.

IMPORTANT: This script checks elapsed CPU time not elapsed run time.
This script will take multiple arguments and alert if a process has consumed more than x number of CPU days e.g:

$ check_long_running_procs.sh -w 10 -c 20
LONG RUNNING PROCESSES WARNING: there is/are 1 long running processes on `hostname`
WARNING PID 950 () from user has used 12 CPU days and has run for a total of 13 days

The script has an optional space seperated exclude list e.g

$ check_long_running_procs.sh -w 10 -c 20 -E "sge pdflush"

The script also has a hidden function which can email a user with a warning message informing them of their long running process. You can enable this by using the “-e” flag followed by your username. You will also need to set the “DOMAIN” variable to your domain name. This function assumes your email address is @ if not you might need to recode a little.

IMPORTANT: This script checks elapsed CPU time not elapsed run time. However it would be pretty easy to recode the check to report on total run time rather than CPU time.

Reviews (1)
byit-praktyk, February 5, 2020
Proposed updates
CPU_DAYS=`if [[ $UTIME != *-* ]]; then echo "0"; else echo $UTIME | awk -F"-" '{print $1}'; fi`
RUN_DAYS=`if [[ $ETIME != *-* ]]; then echo "0"; else echo $ETIME | awk -F"-" '{print $1}'; fi`