Using Nagios to monitor "service-less" hosts
Andy Shellam (Mailing Lists)
andy.shellam-lists at mailnetwork.co.uk
Wed Nov 8 21:58:08 CET 2006
Hi Ted,
I understand the distinction - I *did* have host checks actively
scheduled (ie. the host parameter 'check_interval' set to 1 - this is
now 0 so host checks shouldn't be scheduled, right?) Yet Nagios IS
checking the hosts every few minutes roughly, regardless of child
service status.
Here's a dead simple example - the FH-Gateway - it has a single service,
which is a Ping. The host also has a Ping set as it's
active_check_command parameter.
Now, if I show you the service breakdown for the Ping _service_ on
FH-Gateway:
Current Status:
OK
Status Information: PING OK - Packet loss = 0%, RTA = 3.02 ms
Performance Data:
Current Attempt: 1/2
State Type: HARD
Last Check Type: ACTIVE
Last Check Time: 08-11-2006 20:49:37
Status Data Age: 0d 0h 0m 51s
Next Scheduled Active Check: 08-11-2006 20:50:37
Latency: 0.607 seconds
Check Duration: 9.013 seconds
Last State Change: 08-11-2006 10:46:46
Current State Duration: 0d 10h 3m 42s
Nagios reports it's been in the same state (ie. OK) for 10 hours, 3
minutes, and 42 seconds right?
So why was the host checked only a few seconds ago?
Host Status:
UP
Status Information: PING OK - Packet loss = 0%, RTA = 0.27 ms
Performance Data:
Current Attempt: 1/2
State Type: HARD
Last Check Type: ACTIVE
Last Check Time: 08-11-2006 20:50:49
Status Data Age: 0d 0h 0m 39s
Next Scheduled Active Check: N/A
Latency: 9.113 seconds
Check Duration: 9.011 seconds
Last State Change: 07-11-2006 06:20:35
Current State Duration: 1d 14h 30m 53s
Last Host Notification: N/A
Current Notification Number: 0
Is This Host Flapping?
NO
Percent State Change: 0.00%
In Scheduled Downtime?
NO
Last Update: 08-11-2006 20:51:16
If the general line of thinking is correct, Nagios should have last
checked the host back at (or around) 10:46 this morning when there was a
blip in the service check. But it didn't. It does check them every 1-2
minutes.
My check_interval parameter is 0 - the config viewer in the web CGIs
shows "enabled active checks" as NO for each host.
Since I've been writing this - the above host has been checked again at
20:54:49 - exactly 4 minutes since the last check. No change in the
service status - 10 hours, 9 minutes now.
Any ideas?
Andy.
Tedman Eng wrote:
> Host checks are not actively scheduled in normal operation.
>
> You could go months without requiring a host check, and the status age of
> the host check will show something like 81 days for example.
>
> If you see recent host checks, then that means there was a service problem
> and Nagios wanted to be sure it wasn't the host.
>
> Perhaps if you thought of "host check" as "network link status", it would
> make the distinction more clear.
>
>
>
>> -----Original Message-----
>> From: Andy Shellam (Mailing Lists)
>> [mailto:andy.shellam-lists at mailnetwork.co.uk]
>> Sent: Wednesday, November 08, 2006 11:56 AM
>> To: Sloane, Robert Raymond
>> Cc: nagios-users at lists.sourceforge.net
>> Subject: Re: [Nagios-users] Using Nagios to monitor
>> "service-less" hosts
>>
>>
>> Sloane, Robert Raymond wrote:
>>
>>>> Last Check Time: 08-11-2006 19:34:40
>>>> Next Scheduled Active Check: N/A
>>>>
>>>>
>>> Interesting. Nagios thinks the last check was run over a month ago.
>>>
>>>
>>>
>> No, thankfully! That date is the 8th November (British format.)
>>
>>> You wouldn't see anything about hosts in the scheduling queue. Host
>>> checks are run immediately, not through the queue. That is
>>>
>> why it is
>>
>>> best to not use them.
>>>
>>>
>> I did when the check_interval was set to 1 in the hosts - it
>> showed the
>> host name and a blank service column.
>> I'd mentioned this only to prove the point that the checks do
>> not seem
>> to be scheduled any more, so I cannot figure out why it's
>> still running
>> the host checks at (seemingly) regular intervals.
>>
>> There are no hosts under that machine (or indeed above it), and all
>> services checks are up and have been for a good 6-8 hours.
>>
>> I'm stumped!
>>
>> Andy.
>>
>> --------------------------------------------------------------
>> -----------
>> Using Tomcat but need to do more? Need to support web
>> services, security?
>> Get stuff done quickly with pre-integrated technology to make
>> your job easier
>> Download IBM WebSphere Application Server v.1.0.1 based on
>> Apache Geronimo
>> http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&
>>
> dat=121642
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when reporting
> any issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
> !DSPAM:37,45523de440411286632760!
>
>
>
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list