Monitoring Nagios

Michael W. Lucas mwlucas at blackhelicopters.org
Wed Nov 28 15:01:25 CET 2007


On Wed, Nov 28, 2007 at 02:52:13PM +0100, Mohr James wrote:
> Hi All!
> 
> We had a problem the other day where the nagios process died and no one
> noticed it for over 30 minutes. Therefore, I was thinking about ways to
> monitor Nagios itself. One simply way would be a cronjob that checks if
> the nagios process is running and then uses yaps or something to send an
> SMS. But what about cases where the nagios process is running, but for
> whatever reason the messages are not being processed. 
> 
> I was thinking about setting up a service that is triggered by a cronjob
> and then sends a notification. This notification writes a flag file and
> every X minutes a cron job checks the age of the flag file. If the flag
> file is too old, it sends an SMS via yaps. 
> 
> Rather than re-invent the wheel, I was wondering if other people had
> already implemented something similar. 

Daemontools' supervise program will restart nagios if it crashes.

http://cr.yp.to/daemontools.html

In the event that supervise cannot restart the program, you can have
it send an email.

==ml

-- 
Michael W. Lucas 	mwlucas at BlackHelicopters.org, mwlucas at FreeBSD.org
		http://www.BlackHelicopters.org/~mwlucas/
      Now Shipping: "Absolute FreeBSD" -- http://www.AbsoluteFreeBSD.com
On 5/4/2007, the TSA kept 3 pairs of my soiled undies "for security reasons."

-------------------------------------------------------------------------
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list