<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=utf-8">
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 3.2//EN">
<HTML>
<HEAD>
<META NAME="Generator" CONTENT="MS Exchange Server version 6.0.6249.1">
<TITLE>RE: [Nagios-users] host check interval</TITLE>
</HEAD>
<BODY dir=ltr>
<DIV>This is certainly one of those things I wish I could change easily about
nagios. It's not uncommon for us to have 20-30 devices down on our statewide
network at any given time. This equates to 1-1.5% of the total number of devices
we are monitoring. Because of the fact that nagios goes hard and heavy on host
failures, I've had to make my check-host-alive command just a null command
so that my other checks are not delayed. Allowing a min/max host-check-interval
would be a nice addition.</DIV>
<DIV> </DIV>
<DIV>--</DIV>
<DIV>Marc</DIV>
<BLOCKQUOTE dir=ltr style="MARGIN-RIGHT: 0px">
<DIV><FONT size=2>-----Original Message----- <BR><B>From:</B> Dean Bishop
[mailto:dbishop@ehvert.com] <BR><B>Sent:</B> Wed 5/21/2003 4:41 PM
<BR><B>To:</B> 'Douglas K. Rand'; Nagios-Users <BR><B>Cc:</B>
<BR><B>Subject:</B> RE: [Nagios-users] host check
interval<BR><BR></FONT></DIV>
<P><FONT size=2>Nope. Host alerts do not have any interval. They
are run only if a service</FONT> <BR><FONT size=2>check on that host
fails. If the service check fails once, and only once,</FONT> <BR><FONT
size=2>regardless of retry interval and max check, the host is checked using
the</FONT> <BR><FONT size=2>host's check command. There is no sense in
continuing to test the service</FONT> <BR><FONT size=2>if the host is down
right?</FONT> </P>
<P><FONT size=2>When the host is checked, it is check fast and furious.
The checks are run</FONT> <BR><FONT size=2>at the highest priority above
normally scheduled checks. After all, you may</FONT> <BR><FONT
size=2>have a server down and you would like to know about it right
away.</FONT> </P>
<P><FONT size=2>Hope this helps,</FONT> <BR><FONT size=2>dean</FONT> </P>
<P><FONT size=2>-----Original Message-----</FONT> <BR><FONT size=2>From:
Douglas K. Rand [<A
href="mailto:rand@meridian-enviro.com">mailto:rand@meridian-enviro.com</A>]
</FONT><BR><FONT size=2>Sent: May 21, 2003 4:14 PM</FONT> <BR><FONT size=2>To:
Nagios-Users</FONT> <BR><FONT size=2>Subject: [Nagios-users] host check
interval</FONT> </P>
<P><FONT size=2>I'm trying to adjust the interval between host checks, and I'm
not</FONT> <BR><FONT size=2>having any luck. The directives
normal_check_interrval and</FONT> <BR><FONT size=2>retry_check_interrval don't
work for hosts:</FONT> </P>
<P><FONT size=2> Error: Could not add object property in file</FONT>
<BR><FONT size=2>'/usr/local/etc/nagios/hosts.cfg' on line 13.</FONT> </P>
<P><FONT size=2>And while I have host_check_timeout set to 30, it doesn't seem
to have</FONT> <BR><FONT size=2>any affect. Here is a snippett from my
nagios.log:</FONT> </P>
<P><FONT size=2> [1053541716] HOST ALERT: Hahn-VPN;DOWN;SOFT;1;CRITICAL
- Plugin timed out</FONT> <BR><FONT size=2>after 10 seconds</FONT> <BR><FONT
size=2> [1053541726] HOST ALERT: Hahn-VPN;DOWN;SOFT;2;CRITICAL - Plugin
timed out</FONT> <BR><FONT size=2>after 10 seconds</FONT> <BR><FONT
size=2> [1053541736] HOST ALERT: Hahn-VPN;DOWN;SOFT;3;CRITICAL - Plugin
timed out</FONT> <BR><FONT size=2>after 10 seconds</FONT> <BR><FONT
size=2> [1053541746] HOST ALERT: Hahn-VPN;DOWN;SOFT;4;CRITICAL - Plugin
timed out</FONT> <BR><FONT size=2>after 10 seconds</FONT> <BR><FONT
size=2> [1053541756] HOST ALERT: Hahn-VPN;DOWN;HARD;5;CRITICAL - Plugin
timed out</FONT> <BR><FONT size=2>after 10 seconds</FONT> <BR><FONT
size=2> [1053541756] HOST NOTIFICATION:</FONT> <BR><FONT
size=2>rand;Hahn-VPN;DOWN;host-notify-by-email;CRITICAL - Plugin timed out
after 10</FONT> <BR><FONT size=2>seconds</FONT> <BR><FONT size=2>
[1053541757] SERVICE ALERT: Hahn-VPN;PING;CRITICAL;HARD;1;FPING CRITICAL
-</FONT> <BR><FONT size=2>192.168.2.1 (loss=100.000000% )</FONT> </P>
<P><FONT size=2>As you can see, nagios checks the hosts every 10 seconds. And
that is</FONT> <BR><FONT size=2>too quick for us, we'd like it to check every
minute, because with the</FONT> <BR><FONT size=2>10 second interval we get too
many host down notifications for our</FONT> <BR><FONT size=2>remote
systems.</FONT> </P>
<P><FONT size=2>The docs say:</FONT> </P>
<P><FONT size=2> Also of note - when Nagios is check the status of a
host, it holds</FONT> <BR><FONT size=2> off on doing anything else
(executing new service checks, processing</FONT> <BR><FONT size=2> other
service check results, etc). This can slow things down a bit</FONT> <BR><FONT
size=2> and cause pending service checks to be delayed for a while, but
it</FONT> <BR><FONT size=2> is necessary to determine the status of the
host before Nagios can</FONT> <BR><FONT size=2> take any further action
on the service(s) that are having problems. </FONT></P>
<P><FONT size=2>It isn't clear to me, if it is checking a host, does it hold
off on</FONT> <BR><FONT size=2>ALL other service checks, or just service
checks to that host (and</FONT> <BR><FONT size=2>perhaps child hosts)?</FONT>
</P>
<P><FONT size=2>If it holds off on ALL other service checks, I've got to find
another</FONT> <BR><FONT size=2>way to do this, I can't have no service checks
happening for 5</FONT> <BR><FONT size=2>minutes. </FONT></P><BR>
<P><FONT size=2>-------------------------------------------------------</FONT>
<BR><FONT size=2>This SF.net email is sponsored by: ObjectStore.</FONT>
<BR><FONT size=2>If flattening out C++ or Java code to make your application
fit in a</FONT> <BR><FONT size=2>relational database is painful, don't do it!
Check out ObjectStore.</FONT> <BR><FONT size=2>Now part of Progress Software.
<A
href="http://www.objectstore.net/sourceforge">http://www.objectstore.net/sourceforge</A></FONT>
<BR><FONT size=2>_______________________________________________</FONT>
<BR><FONT size=2>Nagios-users mailing list</FONT> <BR><FONT
size=2>Nagios-users@lists.sourceforge.net</FONT> <BR><FONT size=2><A
href="https://lists.sourceforge.net/lists/listinfo/nagios-users">https://lists.sourceforge.net/lists/listinfo/nagios-users</A></FONT>
<BR><FONT size=2>::: Please include Nagios version, plugin version (-v) and OS
when reporting</FONT> <BR><FONT size=2>any issue. </FONT><BR><FONT size=2>:::
Messages without supporting info will risk being sent to /dev/null</FONT>
</P><BR>
<P><FONT size=2>-------------------------------------------------------</FONT>
<BR><FONT size=2>This SF.net email is sponsored by: ObjectStore.</FONT>
<BR><FONT size=2>If flattening out C++ or Java code to make your application
fit in a</FONT> <BR><FONT size=2>relational database is painful, don't do it!
Check out ObjectStore.</FONT> <BR><FONT size=2>Now part of Progress Software.
<A
href="http://www.objectstore.net/sourceforge">http://www.objectstore.net/sourceforge</A></FONT>
<BR><FONT size=2>_______________________________________________</FONT>
<BR><FONT size=2>Nagios-users mailing list</FONT> <BR><FONT
size=2>Nagios-users@lists.sourceforge.net</FONT> <BR><FONT size=2><A
href="https://lists.sourceforge.net/lists/listinfo/nagios-users">https://lists.sourceforge.net/lists/listinfo/nagios-users</A></FONT>
<BR><FONT size=2>::: Please include Nagios version, plugin version (-v) and OS
when reporting any issue. </FONT><BR><FONT size=2>::: Messages without
supporting info will risk being sent to /dev/null</FONT>
</P></BLOCKQUOTE>
</BODY>
</HTML>