host checks not running until services are restored
frank
ratty at they.org
Fri Apr 8 22:24:11 CEST 2005
Nagios 2.0b2 on Debian Sarge. Been running for about a month.
All my hosts use check_icmp (symlinked as check_host) for HOST checks.
Service checks for this particular host are done over SNMP.
I had a host go down last night at 1:16am. When it was rebooted, the SNMP
daemon was not restarted because I failed to add it to system startup
scripts. My fault of course. So I would expect all the _service_ checks to
fail in this case. And they did.
What confuses me is that the HOST checks (icmp) weren't running at all
until I restarted snmpd, allowing the service checks to complete properly.
It appears that the host checks ran 10 times at 1:16am (per our global
host config), sent out the "host down" alert, and then slept for over 9.5
hours while the SNMP daemon was down. Meanwhile, service checks continued
to run and return "UNKNOWN" values because of their inability to contact
snmpd.
Is this expected behavior? Is it because the service checks return UNKNOWN
instead of CRITICAL? I thought the proper action to be taken when a
service check returns not-OK is to re-execute the host-check. Is this
incorrect?
TIA
-Frank
-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list