Random Down times?

Bill Akins Bill_Akins at emoryhealthcare.org
Fri Feb 18 19:54:20 CET 2005


Thanks for the ideas.  I'll look into the passive service check.  I just
have to convince the Citrix guys to let me install a client on them.

>>> Demetri Mouratis <dmourati at cm.math.uiuc.edu> 2/18/2005 1:28:20 PM
>>>
On Fri, 18 Feb 2005, Andreas Ericsson wrote:

> Demetri Mouratis wrote:
> > On Thu, 17 Feb 2005, Bill Akins wrote:
> >
> >
> >>Hi folks,
> >>
> >>Running Nagios 2.0b1 on RHEL3
> >>
> >>I have about 160 servers monitored with over 600 services monitored
and
> >>Nagios is working great.  I do have a problem with a pool of
Citrix
> >>servers I monitor.  Through Control-M, machines are rebooted at
random
> >>times (once no users are logged on and certain other conditions are
met)
> >>and then they are added back into the pool.
> >>
> >>My question is how can I turn off or delay alerting on unreachable
for
> >>5 minutes to accommodate the reboot?  I want to know if the machine
goes
> >>away for more than 5 minutes.  I am currently checking for several
> >>services running on these machines as well as disk space and ping.
> >
> >
> > Have your citrix servers send a passive service check to your
Nagios box
> > saying to deactivate service checks for that host.  Then have you
init
> > script, or some such on your citrix box turn checks back on once it
comes
> > back up.
> >
>
> This is a good idea, and not terribly cumbersome to implement.
>
> > That or increase your service check times or retries times to
accommodate
> > a 5 minute outage but not a 10 minute one.
>
> This won't work. Nagios will start checking host status upon soft
> service states and do ALL host checks in serial until it gets a good
> response or runs out of max_check_attempts, meaning this idea would
> effectively DOS your nagios every time a machine reboots for
> (hostobject)max_check_attempts * hostcheck_timeout seconds.

You're right.  I was thinking about my own environment where I fake
host
checks with check_dummy to avoid the problem you mention above.
---------------------------------------------------------------------
Demetri Mouratis
dmourati at linfactory.com



-------------------------------------------------------
SF email is sponsored by - The IT Product Guide
Read honest & candid reviews on hundreds of IT Products from real
users.
Discover which products truly live up to the hype. Start reading now.
http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when
reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20050218/0268df04/attachment.html>


More information about the Users mailing list