Latency ok but last check of service hours old?

sum sum at outblaze.com
Fri Sep 19 05:01:31 CEST 2003


Brad & Matt

I always have these problems, and tried to change all configuration, but 
i always found that the service reaper isn't updating the status 
properly. I setup it for distributed monitoring. Three servers are 
distributed servers. It checks about 2222 services for 207 hosts.

~ /var/log/messsgaes result
Sep 19 02:03:18 monitor nsca[11965]: SERVICE CHECK -> Host Name: 
'authbackup4.us4', Service Description: 'FS', Return Code: '1', Output: 
'<b>/var S(96%) I(2%)</b><br><b><a 
HREF=http://operations.outblaze.com/Action/ActFS.html>Action</a></b>'
Sep 19 02:03:39 monitor nagios: EXTERNAL COMMAND: 
PROCESS_SERVICE_CHECK_RESULT;authbackup4.us4;FS;1;<b>/var S(96%) 
I(2%)</b><br><b><a 
HREF=http://operations.outblaze.com/Action/ActFS.html>Action</a></b>

~ status.log
[1063939166] 
SERVICE;authbackup4.us4;FS;WARNING;3/3;HARD;1063884102;1063941400;PASSIVE;0;1;1;1063778378;0;WARNING;5089;0;155860;4822;1063867491;4;1;760;0;1;0;0.00;0;1;1;1;<b>/var 
S(96%) I(2%)</b><br><b><a 
HREF=http://operations.outblaze.com/Action/ActFS.html>Action</a></b>

~ Service State Information
Current Status:                WARNING
Status Information:           /var S(96%) I(2%)
Current Attempt:              3/3
State Type:                     HARD
Last Check Type:             PASSIVE
Last Check Time:             18-09-2003 11:21:42
Status Data Age:             0d 14h 48m 47s
Next Scheduled Active Check:    N/A
Latency:                        N/A
Check Duration:              < 1 second
Last State Change:          17-09-2003 05:59:38
Current State Duration:     1d 20h 10m 51s
Last Service Notification:   18-09-2003 06:44:51
Current Notification Number:    4
Is This Service Flapping?       N/A
Percent State Change:           N/A
In Scheduled Downtime?          NO
Last Update:                    19-09-2003 02:10:27

~ Program-Wide Performance Information
Passive Checks: Time FrameChecks Completed
<= 1 minute:5 (0.6%)
<= 5 minutes:14 (1.6%)
<= 15 minutes:117 (13.3%)
<= 1 hour:528 (60.1%)
Since program start:  517 (58.9%)

On the /var/log/messages, you can find checking result send from 
distributed servers to central. And the PROCESS_SERVICE_CHECK_RESULT  
get the result. But you will find that the last check time of this 
service has not updated. The status log file has not updated it ! Don't 
know the reason ?? Is it bugs ??

When i solved the service problems, the warning or critial need many 
long time to update it.

Anyone has ideas to fix it . I tried the nagios-cvs version , it does 
not work to fix it. Please give us some ideas to fix it . Thanks a lot.

Sum
outblaze

Remark, my central box is redhat 7.3, and kernel is 2.4.20-18.7smp



-------------------------------------------------------
This sf.net email is sponsored by:ThinkGeek
Welcome to geek heaven.
http://thinkgeek.com/sf
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list