Separate mail server problems cause Nagios to plotz (or vice versa?)
Terry Carmen
terry at cnysupport.com
Fri Jun 24 19:42:47 CEST 2011
Quoting up at 3.am:
> We have Nagios monitoring a variety of services on roughly 50
> separate servers. Several of them
> are mail servers, but only the "main" (that contains most of the
> Nagios notification recipients)
> one has this problem.
>
> The mail server will start to become unresponsive so just about any
> input (but pings fine).
This is a mail server issue. You would need to determine exactly what
process(es) have become unresponsive and why.
> Simultaneously, Nagios, which is on a separate server, will send
out
> notifications that every
> service on every server is down because Nagios cannot reach them.
Why can't it reach them? Is your mail server also your router?
Terry
> Since almost all of them go
> through this problem mail server, including those that forward to
> text messaging services, they
> will stop and resume again when the mail server is either rebooted,
> or otherwise is brought back
> to life...sometimes by restarting the LDAP server process on it.
>
> There are perhaps a few dozen total email destinations for
> notifications. Even multiplying this
> times the total number of services that Nagios monitors, it doesn't
> seem likely that it's just
> volume of emails generated by Nagios would cause all this. It is
a
> fairly modern, multiprocessor
> server (CentOS/Sendmail).
>
> Can anyone offer any insight or similar experiences?
>
> Thanks in Advance!
>
>
------------------------------------------------------------------------------
> All the data continuously generated in your IT infrastructure
contains a
> definitive record of customers, application performance, security
> threats, fraudulent activity and more. Splunk takes this data and
makes
> sense of it. Business sense. IT sense. Common sense..
> http://p.sf.net/sfu/splunk-d2d-c1
> _______________________________________________
> Nagios-users mailing list
> Nagios-users at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagios-users
> ::: Please include Nagios version, plugin version (-v) and OS when
> reporting any issue.
> ::: Messages without supporting info will risk being sent to
/dev/null
>
--
Terry Carmen
CNY Support, LLC
Web. Database. Business.
http://www.cnysupport.com
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense..
http://p.sf.net/sfu/splunk-d2d-c1
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list