<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal>We have a qa group overseas that will work on our customer
sites during the US overnight. To avoid false alerts, I added a time
exception so notifications are not sent out between 4am and 5:30am. The problem
is, after the exception, Nagios (3.0.3) won’t send notifications, neither
are checks performed for any sites with an exception. If a site is in a
critical state either shortly after 4 or (if they start early) right before 4,
checks do not continue after 5:30. When I look at Nagios later, it shows
it in critical and the last check was done at 3:58am with the next check at
midnight the next day.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Let me give some more specific examples:<o:p></o:p></p>
<p class=MsoNormal>Server-A is running abc.customer.com for us and our qa group
takes the site down at 3:55am, before the 4am exception. Nagios will show
as critical until either midnight the next day, or you force a check on the
service. So, say at 8am I look at it, the service is critical with last
check at 3:55am and next scheduled check at 12am tomorrow. When I force a
check, it will continue on normal check schedule and send notice that the
service is ok.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Server-B is also running a site and tomcat is stopped at
4:10am. This service has notification period with the same time period
with exceptions from 4am – 5:30am. After that it will not send
notifications. At 8am it is still doing checks and saying is critical,
but when looking at the details it says it has not sent any
notifications. When I force a check it still won’t do it. If I
restart Nagios then it does a check it will send first notice. I don’t
see anything wrong with my time period so not sure where the issue is. Not
sure if anyone else has noticed this before.<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Here is what I have for that time period and checks for the
above examples:<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>define timeperiod{<o:p></o:p></p>
<p class=MsoNormal> timeperiod_name url-monitor<o:p></o:p></p>
<p class=MsoNormal> alias url-monitor<o:p></o:p></p>
<p class=MsoNormal> sunday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> monday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> tuesday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> wednesday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> thursday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> friday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> saturday 00:00-23:59<o:p></o:p></p>
<p class=MsoNormal> exclude recycle<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>define timeperiod{<o:p></o:p></p>
<p class=MsoNormal> timeperiod_name recycle<o:p></o:p></p>
<p class=MsoNormal> alias recycle<o:p></o:p></p>
<p class=MsoNormal> sunday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> monday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> tuesday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> wednesday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> thursday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> friday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> saturday 04:00-05:30<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>define command{<o:p></o:p></p>
<p class=MsoNormal>
command_name check_http_abc<o:p></o:p></p>
<p class=MsoNormal>
command_line $USER1$/check_http -H abc.company.com<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>define service{<o:p></o:p></p>
<p class=MsoNormal> use generic-service <o:p></o:p></p>
<p class=MsoNormal> host_name Server-A<o:p></o:p></p>
<p class=MsoNormal> service_description site
abc<o:p></o:p></p>
<p class=MsoNormal> is_volatile 0<o:p></o:p></p>
<p class=MsoNormal> check_period url-monitor<o:p></o:p></p>
<p class=MsoNormal> max_check_attempts 2<o:p></o:p></p>
<p class=MsoNormal> normal_check_interval 5<o:p></o:p></p>
<p class=MsoNormal> retry_check_interval 5<o:p></o:p></p>
<p class=MsoNormal> contacts nagiosadmin<o:p></o:p></p>
<p class=MsoNormal> notification_interval 30<o:p></o:p></p>
<p class=MsoNormal> notification_period url-monitor<o:p></o:p></p>
<p class=MsoNormal> notification_options w,c,r<o:p></o:p></p>
<p class=MsoNormal> check_command check_http_abc<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>define service{<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'>use
local-service
<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'>host_name Server-B<o:p></o:p></p>
<p class=MsoNormal style='text-indent:.5in'>service_description HTTP<o:p></o:p></p>
<p class=MsoNormal> check_period 24x7<o:p></o:p></p>
<p class=MsoNormal> max_check_attempts 2<o:p></o:p></p>
<p class=MsoNormal> normal_check_interval 3<o:p></o:p></p>
<p class=MsoNormal> retry_check_interval 5<o:p></o:p></p>
<p class=MsoNormal> contacts nagiosadmin<o:p></o:p></p>
<p class=MsoNormal> notification_interval 60<o:p></o:p></p>
<p class=MsoNormal> notification_period url-monitor<o:p></o:p></p>
<p class=MsoNormal> notification_options w,c,r<o:p></o:p></p>
<p class=MsoNormal> check_command check_http<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
</div>
</body>
</html>