Separate mail server problems cause Nagios to plotz (or vice versa?)
up at 3.am
up at 3.am
Fri Jun 24 22:45:00 CEST 2011
>>>
>>>> Simultaneously, Nagios, which is on a separate server, will send
>>> out
>>>> notifications that every
>>>> service on every server is down because Nagios cannot reach them.
>>>
>>>
>>> Why can't it reach them? Is your mail server also your router?
>>
>> Good Gosh, no! That's why this is so puzzling.
>
> The next time it happens, unplug your mail server's network connection (it failed
> anyway). I'll bet it's flooding the network with (good?/bad?) packets and nagios
> can't get through.
We've got good switches (newer Catalysts) and we're not seeing other servers on
the same VLAN or switch affected.
> It taking the mailserver offline fixes it, at least you know where to look,
>
> I'd also check the mailserver logs. Some aren't too bright about handling bounces
> and if it's misconfiigured, you can end up with an infinite number of bounce
> messages for the bounce messages.
Looking for mail loops sounds like a reasonable start. I'm not as used to
sendmail as I am qmail, which seems to handle preventing loops a little better,
AFAICT. I posted to the list to rule out a known issue with nagios, which it
looks like isn't the problem.
Thanks again!
------------------------------------------------------------------------------
All the data continuously generated in your IT infrastructure contains a
definitive record of customers, application performance, security
threats, fraudulent activity and more. Splunk takes this data and makes
sense of it. Business sense. IT sense. Common sense..
http://p.sf.net/sfu/splunk-d2d-c1
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list