Strange problem

Javier Castillo Alcibar Javier.Castillo at alhambra-eidos.com
Thu Jan 20 17:14:25 CET 2005


Hello Michael,

I'm sure this is a bug (sorry, nagios people), because I did never find this problem with netsaint, with the same number of services and hosts monitored... and I had netsaint running on an slower machine!!

With this config, I have problems:
	- interval_length = 60 (default, it's recommended)
	- nomal_check_interval = 1
	- error_check_interval = 1
	- enable_flap_detection = 1
	- max_concurrent_checks = 0

 Now, I made a less aggressive configuration, and nagios seems to be stable:
	- interval_length = 60		(default, it's recommended)
	- nomal_check_interval = 3	(I don't like, 3 minutes in some services is a lot....)
	- error_check_interval = 2    ( :( )
	- enable_flap_detection = 0   ( In Netsaint, I didn't have this enabled, so ....)
	- max_concurrent_checks = 30  ( Although "vmstat" and "top" always show me an idle machine (DL380 G3 P4 HPT + 2Gb ram), I tried this...)

	
So, my situation now is: nagios working fine, but the configuration is not 100% fine for my needs...

I hope this helps you....

Javier.



 


-----Mensaje original-----
De: nagios-users-admin at lists.sourceforge.net [mailto:nagios-users-admin at lists.sourceforge.net] En nombre de Michael Hüttig
Enviado el: jueves, 20 de enero de 2005 16:58
Para: nagios-users at lists.sourceforge.net
CC: Daniel maher
Asunto: Re: [Nagios-users] Strange problem

Am Donnerstag, 20. Januar 2005 15:12 schrieb Daniel maher:
> I had a similar problem just the other night.  One of our servers crashed
> and needed to be rebooted.  When it crashed, and before we had brought it
> back up, Nagios reported a failed ping (as it should have).
>
> The problem is that it /continued/ to report a failed ping all night, even
> when the machine had clearly come back up.  When I forced a check through
> the web interface the next morning, it cleared the state.
>
> Any ideas?

you are not alone, 
using nagios-1.3 from cvs shows the same problem in our environment. If 
everything is allright, nagios works great, but in trouble situation nagios fails to reach a lot of hosts. I think this happens when nagios has to work a 
lot with retry-check-intervall. So i configured mostly hosts with 
host-dependencies and also mostly services with service-dependencies, but this doesn´t bring me help.

I´m looking further for any hints, if you have some good ideas, please let me 
know.

with best regards

Michael
>
> Daniel Maher
> System Engineer
> ACE TECHNOLOGY INC.
>
>
> -----Original Message-----
> From: Javier Castillo Alcibar [mailto:Javier.Castillo at alhambra-eidos.com]
> Sent: January 19, 2005 3:25 AM
> To: nagios-users at lists.sourceforge.net
> Subject: [Nagios-users] Strange problem
>
> Hello all!,
>
> I have an strange problem with nagios. When I have many hosts down,
> because of a network problem, nagios starts failing with hosts that are
> ok. Nagios gets always "Service Check Timed Out" when checking service
> PING (check_ping or check_fping) in hosts that are working nicely.....
>
> It seems a bug, but I'm not sure..... with NetSaint, I did never faced
> this problem....
>
>
> Any ideas?
>
> Thx in advance,
> Javier.
>
>
>
>
> -------------------------------------------------------
> The SF.Net email is sponsored by: Beat the post-holiday blues
> Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
> It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
>
> ::: Please include Nagios version, plugin version (-v) and OS when
> ::: reporting any issue. Messages without supporting info will risk being
> ::: sent to /dev/null


**********************************************************************
Diese E-Mail wurde auf Viren ueberprueft.
www.mimesweeper.com
**********************************************************************



-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null




-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list