passive service freshness with nagios restart
    Ethan Galstad 
    nagios at nagios.org
       
    Wed Jun 20 02:04:39 CEST 2007
    
    
  
Ton Voon wrote:
> Hi Michelle,
> 
> On 19 Jun 2007, at 20:11, Michelle Craft wrote:
> 
>> It looks like it was changed because:
>>
>> http://article.gmane.org/gmane.network.nagios.devel/2438/ 
>> match=passive+freshness+restart
>>
>>>> If the master is stopped for a
>>>> long time and then restarted, the passive checks go stale at the  
>>>> next
>>>> freshness cycle because there is not enough time for the slaves to
>>>> send results back.
>> - but nagios being down for a long time and having passive checks  
>> stale
>> for awhile doesn't seem nearly as bad a problem as passive checks  
>> not ever
>> becoming stale because of restarts.
> 
> I'm the original requestor of the change. My motivation was to reduce  
> the number of stale results in a distributed environment because it  
> "looks bad". I apologise for the unintended side effects.
> 
> Rather than another configuration parameter, can I propose this?
> 
>    * If check_freshness_interval is defined, Nagios will use that  
> value and only check against the last_check_time. No "funny" stuff is  
> calculated
>    * If check_freshness_interval is not defined, then Nagios does a  
> calculation on the best value - currently it does things like using  
> check_interval, add latency time, lowers it if in a soft state. I'll  
> rework my change so that it raises the time if startup time is more  
> recent than last_check_time
> 
> So if you have a service you definitely want marked as stale after a  
> predefined period, use an absolute value (which logically makes sense  
> - if you want freshness, state the exact timeout period). Otherwise,  
> Nagios will try to calculate one which is optimised to a distributed  
> environment (which is the other major use case).
> 
> If there's a consensus to this, I'd be happy to make the change,  
> update the docs and post the patch here for approval. I'll even try  
> and add some tests so that this type of thing is flagged earlier.
> 
> Ton
> 
> http://www.altinity.com
> T: +44 (0)870 787 9243
> F: +44 (0)845 280 1725
> Skype: tonvoon
> 
Good idea.  I just added some code to the 3.0 CVS snapshot that should 
do this.  If it looks okay I can add it to the 2.x branch as well.
Ethan Galstad,
Nagios Developer
---
Email: nagios at nagios.org
Website: http://www.nagios.org
-------------------------------------------------------------------------
This SF.net email is sponsored by DB2 Express
Download DB2 Express C - the FREE version of DB2 express and take
control of your XML. No limits. Just data. Click to get it now.
http://sourceforge.net/powerbar/db2/
    
    
More information about the Developers
mailing list