Huge Service Latencies

Ludwig Pummer Ludwig.Pummer at Copart.Com
Tue Dec 6 00:25:32 CET 2005


I should point out I'm running Nagios 1.3, which will only perform host
checks if a service check fails. I use check_ping and the 30 seconds and
6 seconds numbers were what I passed to check_ping in my
checkcommands.cfg (although it's probable check_ping wasn't waiting the
full 30 seconds, it certainly did wait longer than 6 seconds). It was in
the host template definition that I set max_check_attempts.
 
If you just want to see whether host checks are the source of your
latency, try temporarily replacing your host check command with a shell
script which does "exit 0". You will of course have to suffer through
more service notifications due to downed hosts.

--
Ludwig Pummer
System Administrator
Copart Auto Auctions



 



________________________________

	From: Blake Krone [mailto:blakekrone at gmail.com] 
	Sent: Monday, December 05, 2005 3:14 PM
	To: Ludwig Pummer
	Cc: Nagios Users
	Subject: Re: [Nagios-users] Huge Service Latencies
	
	
	I have active host checks set to 0 so that they won't actually
check unless a service fails on the host. How do you specify a recheck
in seconds? I thought minutes were all that was allowed.
	
	-Blake
	
	
	On 12/5/05, Ludwig Pummer <Ludwig.Pummer at copart.com> wrote: 

		I had large latencies caused by (1) hosts being down and
(2) a host check that took way too long. Nagios will suspend all service
checks while it performs a host check. In my case, the Nagios config I
inherited made 10 attempts at 30 seconds each attempt before declaring a
host to be down, so if one host went down, the host check took 300
seconds. I typically have 15 hosts down out of 475. I changed my host
checks to 3 attempts at 6 seconds each, and latencies went way down.
		 

		--
		Ludwig Pummer
		System Administrator
		Copart Auto Auctions
		
		

		 
		
		

________________________________

			From: nagios-users-admin at lists.sourceforge.net
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of Blake
Krone
			Sent: Monday, December 05, 2005 2:08 PM
			To: Nagios Users
			Subject: [Nagios-users] Huge Service Latencies
			
			
			
			Service Check Execution Time:    0.10 / 10.04 /
2.175 sec
			Service Check Latency:    21.25 / 483.37 /
224.970 sec
			Host Check Execution Time:    0.06 / 10.70 /
4.307 sec
			Host Check Latency:    0.00 / 529.15 / 5.137 sec

			# Active Host / Service Checks:    103 / 236
			# Passive Host / Service Checks:    0 / 1
			
			Machine is a P3 733mhz with 128mb ram running
2.6.11-gentoo-r3 kernel and Nagios 2.03b
			
			
			Any ideas as to why it has such huge latencies? 
			
			


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20051205/5415314d/attachment.html>


More information about the Users mailing list