Locking problems?: Nagios 1.1 on Redhat Enterprise ES 2.1

Mark Ferlatte ferlatte at cryptio.net
Mon Sep 29 22:47:55 CEST 2003


Keith Weinberg said on Mon, Sep 29, 2003 at 04:09:28PM -0400:
> Lock Contention?
> After running the daemon for a while it looks like we get a number process
> stuck in some wait loop:
> nagios   22304     1  0 16:01 ?        00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
> nagios   22311     1  0 16:01 ?        00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
> nagios   22315     1  0 16:01 ?        00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
> 
> [etc. etc. into the hundreds over time]
> 
> Doing an strace of one of these processes, I see that ease of the processes
> is hanging on a write:
> strace -p 22075
> write(6, "<hostname-deleted-for-security-purposes>\0\0\0"..., 504
> (Of course the "hostname deleted" is really one of our hostnames)
 
What's your server_reaper_frequency set to?  Sounds like it's set too high.

I think the default is 10 seconds; I ended up dropping it to 2, and that pretty
much cleared up the problem.

M
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
URL: <https://www.monitoring-lists.org/archive/users/attachments/20030929/aac3eb3a/attachment.sig>


More information about the Users mailing list