Search Exchange
Search All Sites
Nagios Live Webinars
Let our experts show you how Nagios can help your organization.Login
Directory Tree
check_equallogic
- Nagios 1.x
- Nagios 2.x
- Nagios 3.x
- Nagios 4.x
- Nagios XI
Meet The New Nagios Core Services Platform
Built on over 25 years of monitoring experience, the Nagios Core Services Platform provides insightful monitoring dashboards, time-saving monitoring wizards, and unmatched ease of use. Use it for free indefinitely.
Monitoring Made Magically Better
- Nagios Core on Overdrive
- Powerful Monitoring Dashboards
- Time-Saving Configuration Wizards
- Open Source Powered Monitoring On Steroids
- And So Much More!
http://www.claudiokuenzler.com/nagios-plugins/check_equallogic.php
** Version history **
----------------------------------------------------
20091109 Started Script programming checks:
health, disk, raid, uptime, ps, info
20091112 Added ethif, conn
20091118 Added diskusage
20091119 Bugfix on Outputs (removed Pipes)
20091121 Public Release
20091204 Bugfix (removed IP addresses)
20091206 Bugfix (removed SNMP community names)
20091222 Fixed raid, ps, health and diskusage checks when multiple
member devices exists. Mathias Sundman
20100112 Successful tests on PS5000XV - thanks to Scott Sawin
20100209 Compatibility matrix now on website (see Tested on above)
20100416 Beta Testing for rewritten ethif check
20100420 Corrected ethif output, finished new ethif check
20100526 Using proper order of snmpwalk command, thanks Roland Ripoll
20100531 Added perfdata for diskusage and connections, thx to Benoit
20100630 Corrected perfdata output (+added thresholds), thx Christian
20100809 Fixed conn type -> total of all eql members of group
20101026 /bin/sh back to /bin/bash (Ubuntu problems with /bin/dash)
20101026 Bugfix snmpwalk (vqe instead of vq), thanks Fabio Panigatti
20101102 Added fan
20101202 Added volumes (checks utilization of all volumes)
20110315 Bugfix in fan warning, diskusage output changed
20110323 Mysteriously disappeared temp check type added again
20110328 Beta Testing for etherrors check by Martin Conzelmann
20110404 Added thresholds to etherrors check by Martin Conzelmann
20110404 Bugfix in volumes check
20110407 New temp check - more information in output. M. Conzelmann
20110725 New disk check by Amir Shakoor (Bugfixes by Claudio Kuenzler)
20110804 New poolusage check by Chris Funderburg and Markus Becker
20110808 New vol check - checks single volume for utilization
20111013 Bugfix in vol check for similar vol names by Matt White
20111031 Bugfix in ethif check for int response by Francois Borlet
20120104 Bugfix in temp check if only one controller available
20120104 Bugfix in info check if only one controller available
20120123 Bugfix in volumes check
20120125 Added perfdata in volumes check, volume names now w/o quotes
20120319 Added poolconn check by Erwin Bleeker
20120330 Rewrite of poolusage (orig is now: memberusage) by E. Bleeker
20120405 Bugfix in poolusage to show result without thresholds
20120430 Added snapshots type by Roland Penner
20120503 Rewrite of info check (Fix for multiple members, FW Check)
20120815 Added percentage of raid rebuild when raid reconstructing
20120821 Minor bugfix in vol/volumes check (added space in perfdata)
20120911 Added percentage of raid rebuild when raid expanding
20120913 Bugfix in percentage output in raid check
20121204 Added percentage of raid rebuild when raid verifying
20121204 Changed raid percentage output when multiple members around
20121228 ps type now also checks for failed power supply fans
20130728 Added copy to spare raid status by Peter Lieven
20131024 Bugfix in temp check (Backplane_sensor_0 was not shown)
20131025 Optical cleanup
20131122 Bugfix in vol check when volumes spread across members
20131219 Bugfix in poolusage check when a pool was not used (0 size)
20140626 Bugfix in etherrors check
20140711 Added snmp connection check function
20150203 Bugfix in vol check in percentage calculation
20151006 Bugfix in vol check if volume not found by Stephane Loeuillet
20151126 Bugfix in memberusage and poolusage checks (missing newline) #
** Compatibility **
----------------------------------------------------
The plugin has been successfully tested on the following Dell Equallogic devices:
[Please see up to date table on http://www.claudiokuenzler.com/nagios-plugins/check_equallogic.php]
** Requirements **
----------------------------------------------------
- The following shell commands must exist and be executable by your Nagios user: snmpwalk, awk, grep, wc
- SNMP must be enabled on the Dell Equallogic device. If it is not already, enable it on the member.
** Definition of parameters **
----------------------------------------------------
-H Hostname or IP address of Equallogic to check
-C SNMP Communityname (must be at least readable)
-t Type of check you want to do (see the definition of types further down)
-v Name of single volume to check
[-w] Warning threshold (optional and only in combination with certain types)
[-c] Critical threshold (optional and only in combination with certain types)
--help Help text for correct usage of this script
** Definition of the check types **
----------------------------------------------------
conn -> Checks number of current ISCSI connections (thresholds possible)
disk -> Checks Status of all disks
diskusage -> Checks how much raid space is already used (thresholds possible)
etherrors -> Checks ethernet interfaces for ethernet packet errors
ethif -> Checks status of ethernet interfaces (thresholds possible)
fan -> Checks status of fans
health -> Checks overall health of Equallogic device
info -> Shows general information of Equallogic device
memberusage -> Shows disk utilisation of all members of the same group (thresholds possible)
poolconn -> Check highest number of ISCSI connections per pool (thresholds possible)
poolusage -> Checks utilization of pools (thresholds possible)
ps -> Checks status of power supply(ies)
raid -> Checks RAID status
snapshots -> Checks Snapshot Reserve status (warning level is taken from the equallogic volume config, critical level can be set with -c )
temp -> Checks temperature sensors
uptime -> Shows uptime of Equallogic device
vol -> Checks a single volume, must be used with -v option (thresholds possible)
volumes -> Checks utilization of all created ISCSI volumes (thresholds possible)
Thanks for the plugin! Works great on my ubuntu Nagios box. I am only having trouble with one service check that specifically check one volume. I get the following error message:
./check_equallogic.sh: line 804: / 1024: syntax error: operand expected (error token is "/ 1024")
Do you know where this error is coming from? I've tried several options I googled but I cannot get this check fixed. As this is the best plugin for equallogic I would like to get this working :).
Thanks!
Fixed in version 20150203.
If my volume is stored on more than 1 eql, it fails with the UNKNOWN error.
I've try to debug it, but I'm not good as I want in shell script. I can help you with send you any snmp info if you don't have 2 eql to made this test.
Yes, please contact me on www.claudiokuenzler.com
I see where this could be improved. Please contact me on www.claudiokuenzler.com to analyze this together.
if anyone has come across this problem like mine..I have storage group of 6 member SAN boxes..plugin monitors them well but if any member is offline it shows health OK with 5 members whereas group console shows critical errors due to member offline..Not sure if I am missing something.
You probably refer to the health check? I've actually never seen it become critical, even when a disk fails. Might as well remove that check as it doesn't seem to be a real reference...
I know, this is bugging. I've talked with one of the Engineers of the Equallogic about that 'problem' but this is the 'Equallogic design'. So every member is giving out information for all other members in the same group. Helpful for volume checks but annoying for hardware related checks like PS, I agree.
When your Equallogic boxes are setup in a group like mine are, service checks like diskstatus/powersupplies/volumes wil give you values from the group combined, even though your host ip is an interface on one of your SAN's.
i.e. diskstatus gives a value like "DISK OK 42 disks OK 4 disks spare"