How often does Nagios need restarting? (Quiscustodiet ipsos custodes?)
James Pratt
jpratt at norwich.edu
Sat Jun 20 03:23:24 CEST 2009
Hi Tom, I've tried to answer your questions to the best of my own
personal knowledge -I have replaced any of your original "*" symbols
with my own on all my comments/thoughts below, since my MS outlook
client apparently just sucks, so this appears more readable.
Regards,
jamie
-----Original Message-----
From: Kustner, Tom [mailto:Tom.Kustner at RetirementPartner.com]
Sent: Friday, June 19, 2009 5:35 PM
To: nagios-users at lists.sourceforge.net
Subject: [Nagios-users] How often does Nagios need restarting?
(Quiscustodiet ipsos custodes?)
I am a Nagios user, not the administrator. We are running Nagios 2.9 on
RHEL 4 or 5. Overall, 200+ hosts with 3000 services being monitored. I
have access for monitoring a smaller number of hosts.
* ok, understood...
In another posting, I alluded to an issue where a host had gone down but
no alert was sent out. The issue surfaced again today and as was done
the other time, Nagios was restarted to "fix" the problem. I am
naturally concerned about the unreliability.
* did you get any on-list or off-list replies at all? You have not
mentioned if you had it resolved or not, but it sound like the answer is
no to possibly both(?)
Any thoughts on this problem? Specifically:
What are best practices for making sure Nagios does not fall down on
the job? Is there something not set right?
* Understanding your setup and the way nagios works is how you ensure it
stands up... a mis-config sounds likely, but who knows...
Are other Nagios administrators restarting Nagios on a weekly or
nightly basis to keep it on the job?
* Heck no! That's why we run it on Linux or Solaris! :)
Is this an issue specific to Nagios 2.9? Was 2.9 a spotty version?
*Not to my knowledge - all stable releases have worked very reliably
here, especially 2.9 now that I look back...
For a given host, why would "active checks" be enabled, yet "N/A"
appears in the "Next Active Check" field?
* RTM - host checks are not always performed unless service checks fail,
and since I've been a manual-slacker myself, that may not even be the
true correct answer (Marc? :)
Thanks for any help.
-Tom Kustner-
* Not to sound negative/condescending or anything like that, but your
install will truly only work as well as you have maintained
it/understand it. You should really look at your current config files
and read the manual on 2.9, or upgrade to 3.x and again rtm... Also,
you have not sent anything specific related to your problematic
config(s) for anyone on this list to even guess either way whether or
not something is mis-configured. If you are concerned about posting your
configs/setup, change stuff properly to hide what you need to on-list.
(I apologize if I have missed your earlier posting. Many here try our
best to help people here when possible, but sometimes we are all busy at
the same time, who knows!?).
Cheers,
Jamie
------------------------------------------------------------------------------
Are you an open source citizen? Join us for the Open Source Bridge conference!
Portland, OR, June 17-19. Two days of sessions, one day of unconference: $250.
Need another reason to go? 24-hour hacker lounge. Register today!
http://ad.doubleclick.net/clk;215844324;13503038;v?http://opensourcebridge.org
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list