~5000 second for 'Active Service Latency" (could this delay my checks?)
Aaron M. Segura
aaron.segura at cabelas.com
Tue Feb 5 00:45:59 CET 2008
On Mon, 2008-02-04 at 15:32 -0800, Roger wrote:
> I am having a helluva time getting lots of SNMP checks to properly
> show up in my Nagios GUI after I create them and restart
> the /etc/init.d/nagios service. I have about 900 hosts, all of which
> are not checked (via ping) very much (like every 5 minutes). My
> services are checked about every 10 minutes, unless there is a
> problem, then I check them at 2 minute intervals.
>
>
> top/htop and 'uptime' show the box has very little load, but the
> following nagiostats dump shows almost a 5000 active service latency.
> Could this be the problem of why it takes forever and a day for
> checks to show? And if so, might this be a network latency issue?
>
>
>
>
> Status File: /var/log/nagios/status.dat
> Status File Age: 0d 0h 0m 3s
> Status File Version: 2.10
>
> Program Running Time: 3d 4h 30m 30s
> Nagios PID: 29689
> Used/High/Total Command Buffers: 0 / 1 / 4096
> Used/High/Total Check Result Buffers: 0 / 82 / 4096
>
> Total Services: 637
> Services Checked: 637
> Services Scheduled: 637
> Active Service Checks: 637
> Passive Service Checks: 0
> Total Service State Change: 0.000 / 17.570 / 0.421 %
> Active Service Latency: 4577.550 / 4713.980 / 4634.530
> sec
> Active Service Execution Time: 0.042 / 11.084 / 0.550 sec
> Active Service State Change: 0.000 / 17.570 / 0.421 %
> Active Services Last 1/5/15/60 min: 10 / 56 / 208 / 571
> Passive Service State Change: 0.000 / 0.000 / 0.000 %
> Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0
> Services Ok/Warn/Unk/Crit: 458 / 45 / 25 / 109
> Services Flapping: 0
> Services In Downtime: 0
>
> Total Hosts: 908
> Hosts Checked: 907
> Hosts Scheduled: 906
> Active Host Checks: 908
> Passive Host Checks: 0
> Total Host State Change: 0.000 / 96.120 / 5.756 %
> Active Host Latency: 0.000 / 4731.771 / 4440.847 sec
> Active Host Execution Time: 0.000 / 10.072 / 4.558 sec
> Active Host State Change: 0.000 / 96.120 / 5.756 %
> Active Hosts Last 1/5/15/60 min: 11 / 61 / 172 / 647
> Passive Host State Change: 0.000 / 0.000 / 0.000 %
> Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
> Hosts Up/Down/Unreach: 835 / 73 / 0
> Hosts Flapping: 0
> Hosts In Downtime: 0
> -------------------------------------------------------------------------
Good lord, man! Turn off those active host checks!
Nagios doesn't handle host checks very efficiently. Typically, you
should only execute a host check if a service on that host fails.
Create a service on each host that emulates your current host check and
turn off active host checking. That should fix your problem.
Sorry if I'm misreading something, but that's what it looks like to me.
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2008.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list