Notification did not execute
Patrick Morris
patrick.morris at hp.com
Thu Aug 30 20:27:20 CEST 2012
On 8/29/2012 3:54 AM, Net Warrior wrote:
> Cannot do that cuz it-s a production server, I have a testing server
> where I test the alarms with the same configuration and it works
> If I find something I will let you know.
>
> Thanks
>
>
> 2012/8/28 Travis Runyard <travisrunyard at gmail.com>:
>> That is really strange. Have you tried changing its ip to an invalid one and
>> see if you can recreate the problem?
>>
>> On Aug 28, 2012 5:29 PM, "Net Warrior" <netwarrior863 at gmail.com> wrote:
>>> Thanks for your advice, I know what you mean, but this is not the case
>>> unfortunately, In the logs I can see the 3 time check was reached and
>>> the alarm did not fired, now the server is up and I did not get the
>>> notification the server is back on line either.
>>>
>>> Is critical and not SOFT for the host and the service
>>> [1346193488] HOST ALERT: hostname;DOWN;HARD;3;(Host Check Timed Out)
>>> [1346193328] SERVICE ALERT: hostname;SSH;CRITICAL;HARD;3;CRITICAL -
>>> Socket timeout after 30 seconds
>>>
>>> [1346198377] HOST ALERT: hostname;UP;SOFT;3;PING OK - Packet loss =
>>> 0%, RTA = 4.84 ms
>>>
>>> This is weird.
>>>
>>> Thanks for your time and support
>>> Regards
>>>
>>>
>>> 2012/8/28 Travis Runyard <travisrunyard at gmail.com>:
>>>> Your should look at the log before the service notification was
>>>> suppressed
>>>> because the host was down. I'm 90% sure the reason why your host down
>>>> notification was not sent was because the host was in a soft state type
>>>> and
>>>> came back online before it was determined to be in a hard state. And you
>>>> didn't post your linux-server template config so you will have to check
>>>> your
>>>> and retry_interval definition.
>>>>
>>>> For my host objects I set a retry_interval of 1 and max_check_attemps of
>>>> 2
>>>> in the template so I get notified a little quicker when a host goes
>>>> down.
>>>>
>>>> http://nagios.sourceforge.net/docs/3_0/objectdefinitions.html
>>>>
>>>> -Travis Runyard
>>>>
I would also check to make sure host or global notifications were not
disabled at that point in time, and that flap detection had not kicked
in. Those are the most likely reasons I can think of for no
notification to happen even though the host was in a hard critical state.
------------------------------------------------------------------------------
Live Security Virtual Conference
Exclusive live event will cover all the ways today's security and
threat landscape has changed and how IT managers can respond. Discussions
will include endpoint security, mobile security and the latest in malware
threats. http://www.accelacomm.com/jaw/sfrnl04242012/114/50122263/
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list