Service notifications when host is down
Quanah Gibson-Mount
quanah at stanford.edu
Thu Apr 22 17:53:32 CEST 2004
--On Thursday, April 22, 2004 9:04 AM -0400 Sean Dilda
<agrajag at dragaera.net> wrote:
> On Wed, 2004-04-21 at 19:20, Quanah Gibson-Mount wrote:
>> Quoting Ben Whaley <Benjamin.Whaley at colorado.edu>:
>>
>> > > I though of setting a 'ping' service check on the host, and making
>> > all other
>> > > checks dependent on it, but that seems to me to be more of a
>> > workaround than
>> > > a solution, and it doesn't fully solve the scheduled downtime
>> > problem.
>> >
>> > Yes, we had the same idea. I am currently using some other, similar
>> > work
>> >
>> > arounds to solve problems that Nagios doesn't have a solution for but
>> > they have introduced more problems than they've fixed.
>> >
>> > What's strange about this particular case, however, is that Nagios
>> > *usually* catches it. For example, in the following sequence, the host
>> > down alert was generated before the service checks, thus avoiding the
>> > unnecessary notification:
>>
>> The consistent problem we have seen with Nagios is that once a host goes
>> down, it only ever emits a single host down alert, and does not keep
>> paging that the host is down (it correctly does not keep paging that
>> the services for the host are down). Despite querying the list, an
>> answer to this has never appeared. Perhaps it is time to file a bug on
>> this in sourceforge.
>
> The only time I ever had this problem was because my notification
> commands were broken. Nagios uses different notification commands for
> hosts and services, so your service notification command could be
> working while your host one isn't. Did you check the logs to see if
> nagios attempted a notification?
Please read what I said... We do get >one< notification. If the
notification script was broken, we wouldn't get any. The problem is that
Nagios never sends out any more notifications after the initial one.
Viewing the logs shows that Nagios does continue to query the host via the
host check command, and continues to see that it is down -- It just never
sends out further notifications that the host is down. We even verified
that Nagios was correctly checking the host by sniffing the TCP traffic
between Nagios and the host.
--Quanah
--
Quanah Gibson-Mount
Principal Software Developer
ITSS/TSS/Computing Systems
ITSS/TSS/Infrastructure Operations
Stanford University
GnuPG Public Key: http://www.stanford.edu/~quanah/pgp.html
-------------------------------------------------------
This SF.Net email is sponsored by: IBM Linux Tutorials
Free Linux tutorial presented by Daniel Robbins, President and CEO of
GenToo technologies. Learn everything from fundamentals to system
administration.http://ads.osdn.com/?ad_id=1470&alloc_id=3638&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list