check_cluster2 does not work as expected
Werner Flamme
werner.flamme at ufz.de
Mon Sep 7 11:06:08 CEST 2009
Jim Avery [07.09.2009 10:03]:
> 2009/9/5 Werner Flamme <werner.flamme at ufz.de>:
>> Hi,
>>
>> I want to check a cluster consisting of 2 nodes. The task is simple:
>> show how many nodes are up (respective down, there are two nodes).
>>
>> The command definition is:
>>
>> $USER1$/check_cluster2 -h -l $HOSTALIAS$ -w 1 -c 2 -d $ARG1$
>>
>> So, the host alias of the cluster will be the label, the plugin should
>> give a "warning" when 1 node is down, and should cry "critical" when
>> both nodes are down.
>>
>> That's what I thought this command would do.
>>
>> And that's what I read in the mean time:
>>
>> CLUSTER OK: FW-Cluster: 1 up, 1 down, 0 unreachable
>>
>> Sorry? Why is "1 down" not seen as warning? What do I do wrong?
>>
>> TIA
>> Werner
>
>
> I'm not familiar with check_cluster2, but came across a similar
> situation when using check_cluster for a host check recently. When I
> run check_cluster --help, it tells me :-
>
> See:
> http://nagiosplug.sourceforge.net/developer-guidelines.html#THRESHOLDFORMAT
> for THRESHOLD format and examples.
>
> I found I had to use "-w 0 -c 1" to make the plugin behave how I
> wanted (warn if one host is down and critical if two are down).
Good graciuos ;-) - it never occured to me when reading
-w, --warning=THRESHOLD
Specifies the range of hosts or services in cluster that must be in
a non-OK state in order to return a WARNING status level
that a THRESHOLD of zero hosts means 1 host. But this really seems to
work, since when both nodes were down (yesterday afternoon and this
morning), the status finally changed to WARNING.
All right, I changed -w 1 -c 2 to -w 0 -c 1. I still wonder how we
managed to get the alarms on the other clusters weeks ago :-(
> Also, if you're using this as a host check (not a service check) note
> that if the host check returns a warning state, Nagios will usually
> interpret this to mean that the host is 'UP'. See
> http://nagios.sourceforge.net/docs/3_0/hostchecks.html for an
> explanation of how plugin results are interpreted for host checks in
> Nagios.
Thanks - lucklily we use a test that only returns OK and CRITICAL.
Thanks for the hint - I think one of the nodes might fall down any
minute now, I will tell if this works ;-)
Regards,
Werner
------------------------------------------------------------------------------
Let Crystal Reports handle the reporting - Free Crystal Reports 2008 30-Day
trial. Simplify your report design, integration and deployment - and focus on
what you do best, core application coding. Discover what's new with
Crystal Reports now. http://p.sf.net/sfu/bobj-july
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list