Search Exchange

Search All Sites

Nagios Live Webinars

Let our experts show you how Nagios can help your organization.

Contact Us

Phone: 1-888-NAGIOS-1
Email: sales@nagios.com

Login

Remember Me

Directory Tree

check_hadoop_jobtrackers

Current Version
0.3
Last Release Date
2011-11-27
Compatible With
  • Nagios 3.x
License
GPL
Hits
82756
Files:
FileDescription
check_hadoop_jobtracker.plcheck_hadoop_jobtracker.pl
Nagios CSP

Meet The New Nagios Core Services Platform

Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.

Monitoring Made Magically Better

  • Nagios Core on Overdrive
  • Powerful Monitoring Dashboards
  • Time-Saving Configuration Wizards
  • Open Source Powered Monitoring On Steroids
  • And So Much More!
Improved script to check apache hadoop jobtracers.
Will parse the admin-webpage, and check number of trackers, blacklisted nodes, heap size and more.
Will output performancedata for nodes, blacklisted nodes, running jobs (map and reduce), capacity jobs (map/reduce) and heap size (used/max)
I made this little script, to parse output from the adminpage, normally found at http://jobtracker.company.com:50030/jobtracker.jsp
It will check for status, and number of machines, and also give performanceoutput.
Nagios-config would look something like this;

define service{
use generic_service
service_description Jobtracker
check_command check_jobtracker
hosts jobtracker.company.com
}

define command{
command_name check_jobtracker
command_line $USER1$/check_hadoop_jobtracker.pl -H jobtracker.company.com -w 50 -c 40 -b 1 -p 50030
}

Plugin takes these options:

check_hadoop_jobtracker v. 0.3
Copyright (c) 2011 Jon Ottar Runde, jru@rundeconsult.no
See http://www.rundeconsult.no/?p=66 for updated versions and documentation
Usage: -w -c -H -p [-v version] [-h help]

Checks several Hadoop hdfs-parameters
-H (--Host)
-p (--Port)
-w (--warning) = warning limit number of machines
-c (--critical) = critical limit number of machines (w > c )
-b (--blacklist) = Number of blacklisted nodes for CRITICAL warning (default=1)
-h (--help)
-v (--version)