Betr.: RE: High check latency with nagios
Cook, Garry
GWCOOK at mactec.com
Wed May 26 18:57:41 CEST 2004
I currently monitor about 200 hosts and 700 services. However, in the
past I have monitored 500 hosts and 2500 services with no issues on a
PIII 866Mhz box.
Changing the number of simultaneous checks as suggested by Nagios is a
good start. However, there are several other parameters that you may
want to look at, as described in these docs:
http://nagios.sourceforge.net/docs/1_0/checkscheduling.html
There is probably additional information relevant to your issue in the
docs and/or FAQ, do a little digging.
You may also want to look into distributed monitoring.
When responding, please email the list, and not only to me. I don't have
all the answers and there are people out there with more knowledge than
I that may be able to offer further assistance. Also, it's a good idea
to have these threads in the archives, so that maybe it will help
someone else in the future.
Garry W. Cook, CCNA
Network Infrastructure Manager
MACTEC, Inc. - http://www.mactec.com/
303.308.6228 (Office) - 720.220.1862 (Mobile)
-----Original Message-----
From: marino.simons at acerta.be [mailto:marino.simons at acerta.be]
Sent: Wednesday, May 26, 2004 1:47 AM
To: Cook, Garry
Subject: Betr.: RE: [Nagios-users] High check latency with nagios
Hi,
Thanks for your advice, but I've already tried this, nagios suggests
560 simultanous checks, I changed this value in nagios.cfg, but it
doesn't make any difference... For some reason the entire system bogs
down.
Anyway I am planning to remove 80% of the hosts from nagios, and then I
hope to see some improvement..
Just to have an idea about the size of my nagios setup, how many
hosts/services do you monitor with nagios?
Thanks®ards,
Marino Simons
Acerta ICT, Network Administrator
Tel: 016246776
"Cook, Garry" <GWCOOK at mactec.com>
24/05/2004 22:12
Aan: <marino.simons at acerta.be>
cc:
Onderwerp: RE: [Nagios-users] High check latency with
nagios
Run Nagios with the help parameter '-h'. This will give you a
description of a few other parameters that can be run. I think the one
you will be interested in is '-s', which analyzes your check scheduling
information and makes suggestions for improvement. You'll probably need
to run the command like so:
/path/to/nagios/bin/nagios -s /path/to/nagios/etc/nagios.cfg
HTH
Garry W. Cook, CCNA
Network Infrastructure Manager
MACTEC, Inc. - <http://www.mactec.com/> http://www.mactec.com/
303.308.6228 (Office) - 720.220.1862 (Mobile)
-----Original Message-----
From: nagios-users-admin at lists.sourceforge.net
[mailto:nagios-users-admin at lists.sourceforge.net] On Behalf Of
marino.simons at acerta.be
Sent: Monday, May 24, 2004 12:59 AM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] High check latency with nagios
Hi All,
We are setting up nagios to monitor our infrastructure, and we ran into
a few problems. Most of them I"ve been able to solve, thanks to
reading the mailinglist. But the latest problem is a persistent one.
I'm running nagios on Suse Enterprise server 9, with kernel 2.6.5 on a
dual 2.4 ghz intel Xeon server with hyperthreading enabled, the system
has 2GB ram. We recompiled the kernel with SMP-support, en it detects
4 cpu's.. In nagios we defined 405 servers, and we do 6855 active
checks, and 17 passive checks. And we have an extremely bad
performance.
At the tactical overview I see the following information: check
latency: 4468.402 sec. It takes nagios about 1,5 hour to see a status
change. Needless to say that this is not acceptable.
Anyway I am looking for some hints, does anybode have an idea on what
causes this behavior?
Thanks in advance!!
Marino
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20040526/a0cf0ce6/attachment.html>
More information about the Users
mailing list