Strange problem
Javier Castillo Alcibar
Javier.Castillo at alhambra-eidos.com
Thu Jan 20 17:14:25 CET 2005
Hello Michael,
I'm sure this is a bug (sorry, nagios people), because I did never find this problem with netsaint, with the same number of services and hosts monitored... and I had netsaint running on an slower machine!!
With this config, I have problems:
- interval_length = 60 (default, it's recommended)
- nomal_check_interval = 1
- error_check_interval = 1
- enable_flap_detection = 1
- max_concurrent_checks = 0
Now, I made a less aggressive configuration, and nagios seems to be stable:
- interval_length = 60 (default, it's recommended)
- nomal_check_interval = 3 (I don't like, 3 minutes in some services is a lot....)
- error_check_interval = 2 ( :( )
- enable_flap_detection = 0 ( In Netsaint, I didn't have this enabled, so ....)
- max_concurrent_checks = 30 ( Although "vmstat" and "top" always show me an idle machine (DL380 G3 P4 HPT + 2Gb ram), I tried this...)
So, my situation now is: nagios working fine, but the configuration is not 100% fine for my needs...
I hope this helps you....
Javier.
-----Mensaje original-----
De: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] En nombre de Michael Hüttig
Enviado el: jueves, 20 de enero de 2005 16:58
Para: nagios-users at lists.sourceforge.net
CC: Daniel maher
Asunto: Re: [Nagios-users] Strange problem
Am Donnerstag, 20. Januar 2005 15:12 schrieb Daniel maher:
> I had a similar problem just the other night. One of our servers crashed
> and needed to be rebooted. When it crashed, and before we had brought it
> back up, Nagios reported a failed ping (as it should have).
>
> The problem is that it /continued/ to report a failed ping all night, even
> when the machine had clearly come back up. When I forced a check through
> the web interface the next morning, it cleared the state.
>
> Any ideas?
you are not alone,
using nagios-1.3 from cvs shows the same problem in our environment. If
everything is allright, nagios works great, but in trouble situation nagios fails to reach a lot of hosts. I think this happens when nagios has to work a
lot with retry-check-intervall. So i configured mostly hosts with
host-dependencies and also mostly services with service-dependencies, but this doesn´t bring me help.
I´m looking further for any hints, if you have some good ideas, please let me
know.
with best regards
Michael
>
> Daniel Maher
> System Engineer
> ACE TECHNOLOGY INC.
>
>
> -----Original Message-----
> From: Javier Castillo Alcibar [mailto:Javier.Castillo at alhambra-eidos.com]
> Sent: January 19, 2005 3:25 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Strange problem
>
> Hello all!,
>
> I have an strange problem with nagios. When I have many hosts down,
> because of a network problem, nagios starts failing with hosts that are
> ok. Nagios gets always "Service Check Timed Out" when checking service
> PING (check_ping or check_fping) in hosts that are working nicely.....
>
> It seems a bug, but I'm not sure..... with NetSaint, I did never faced
> this problem....
>
>
> Any ideas?
>
> Thx in advance,
> Javier.
>
>
>
>
> -------------------------------------------------------
> The SF.Net email is sponsored by: Beat the post-holiday blues
> Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
> It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
> ::: Please include Nagios version, plugin version (-v) and OS when
> ::: reporting any issue. Messages without supporting info will risk being
> ::: sent to /dev/null
**********************************************************************
Diese E-Mail wurde auf Viren ueberprueft.
www.mimesweeper.com
**********************************************************************
-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list