Network tuning for Nagios with slave servers
Andreas Ericsson
ae at op5.se
Fri Sep 7 14:03:11 CEST 2007
Wheeler, JF (Jonathan) wrote:
> Our configuration is quite large (830 hosts, 160700+ services),
You run more than 193 checks against each host? Good gods, you must
be *really* curious about the state of those hosts :)
> so have
> implemented a master/slave configuration for Nagios (the Nagios servers
> are running Linux). The master server only runs checks if a check
> becomes stale; i.e. it should have been checked by a slave but no result
> has been received, but I find that (for example), in the last days log
> there are 80,000 + warning messages saying the master has run a check
> because it has become "stale". On further investigation I find that on
> all of our 5 slaves the command netstat shows that there are a large
> number of TCP sockets in CLOSE_WAIT state (more . My question is, has
> anyone done any network tuning to improve Nagios network performance ?
>
Nope, but you could try doing
sysconf net.ipv4.tcp_fin_timeout=30
to halve the default tcp timeout in the kernel, which should reduce
the number of half-open connections you have.
--
Andreas Ericsson andreas.ericsson at op5.se
OP5 AB www.op5.se
Tel: +46 8-230225 Fax: +46 8-230231
-------------------------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc.
Still grepping through log files to find problems? Stop.
Now Search log events and configuration files using AJAX and a browser.
Download your FREE copy of Splunk now >> http://get.splunk.com/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list