Locking problems?: Nagios 1.1 on Redhat Enterprise ES 2.1
Mark Ferlatte
ferlatte at cryptio.net
Mon Sep 29 22:47:55 CEST 2003
Keith Weinberg said on Mon, Sep 29, 2003 at 04:09:28PM -0400:
> Lock Contention?
> After running the daemon for a while it looks like we get a number process
> stuck in some wait loop:
> nagios 22304 1 0 16:01 ? 00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
> nagios 22311 1 0 16:01 ? 00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
> nagios 22315 1 0 16:01 ? 00:00:00 /usr/bin/nagios -d
> /etc/nagios/n
>
> [etc. etc. into the hundreds over time]
>
> Doing an strace of one of these processes, I see that ease of the processes
> is hanging on a write:
> strace -p 22075
> write(6, "<hostname-deleted-for-security-purposes>\0\0\0"..., 504
> (Of course the "hostname deleted" is really one of our hostnames)
What's your server_reaper_frequency set to? Sounds like it's set too high.
I think the default is 10 seconds; I ended up dropping it to 2, and that pretty
much cleared up the problem.
M
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 187 bytes
Desc: not available
URL: <https://www.monitoring-lists.org/archive/users/attachments/20030929/aac3eb3a/attachment.sig>
More information about the Users
mailing list