<br><font size=2 face="sans-serif">Greetings,</font>
<br>
<br><font size=2 face="sans-serif">it seems I triggered a bug with our
new nagios instance, as it shows quite a strange behaviour.</font>
<br><font size=2 face="sans-serif">Quoting from the nagios 3.x documentation:
http://nagios.sourceforge.net/docs/3_0/notifications.html</font>
<br><font size=2 face="sans-serif">Service and Host Filters:</font>
<br>
<br><font size=2 face="sans-serif">"Note: Notifications about host
or service recoveries are only sent out if a notification was sent out</font>
<br><font size=2 face="sans-serif">for the original problem. It doesn't
make sense to get a recovery notification for something you never</font>
<br><font size=2 face="sans-serif">knew was a problem... "</font>
<br>
<br><font size=2 face="sans-serif">This is what happened:</font>
<br>
<br><font size=2 face="sans-serif">1. Service went CRITICAL -> Notifications
to the contacts user1-mail, user2-mail</font>
<br><font size=2 face="sans-serif">2. Service went WARNING -> Notifications
to the contacts user1-mail, user2-mail</font>
<br><font size=2 face="sans-serif">3. Service went OK -> Notifications
to the contacts user1-mail,user2-mail,user1-sms,user2-sms</font>
<br>
<br><font size=2 face="sans-serif">vmctx02 CPU
CRITICAL 18-08-2008
16:24:50 user1-mail
mail-notification CRITICAL:
15m: average load 100% critical</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
CRITICAL 18-08-2008
16:24:50 user2-mail
mail-notification CRITICAL:
15m: average load 100% critical</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
WARNING 18-08-2008
16:31:50 user1-mail
mail-notification WARNING:
15m: average load 99% warning</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
WARNING 18-08-2008
16:31:50 user2-mail
mail-notification WARNING:
15m: average load 99% warning</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
OK 18-08-2008
16:32:50 user1-sms
sms-notification OK:
15m: average load 92%</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
OK 18-08-2008
16:32:50 user2-sms
sms-notification OK:
15m: average load 92%</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
OK 18-08-2008
16:32:50 user1-mail
mail-notification OK:
15m: average load 92%</font>
<br><font size=2 face="sans-serif">vmctx02 CPU
OK 18-08-2008
16:32:50 user2-mail
mail-notification OK:
15m: average load 92%</font>
<br>
<br><font size=2 face="sans-serif">I do not understand why the 2 sms contacts
were notified, they never received a</font>
<br><font size=2 face="sans-serif">problem notification in first place.
It was an escalation which triggered those sms -</font>
<br><font size=2 face="sans-serif">but it shouldn't have in my opinion.
It seems it only happens in our environment, if</font>
<br><font size=2 face="sans-serif">exactly 2 notifications were sent before
a recovery.</font>
<br>
<br><font size=2 face="sans-serif">These are the relevant configs:</font>
<br>
<br>
<br><font size=2 face="sans-serif">Contacts and Templates (user1 and user2
are identical):</font>
<br>
<br>
<br><font size=2 face="sans-serif">define contact {</font>
<br><font size=2 face="sans-serif">
name
generic-contact-mail</font>
<br><font size=2 face="sans-serif">
host_notification_period 24x7</font>
<br><font size=2 face="sans-serif">
service_notification_period 24x7</font>
<br><font size=2 face="sans-serif">
host_notification_options d,r</font>
<br><font size=2 face="sans-serif">
service_notification_options u,c,w,r</font>
<br><font size=2 face="sans-serif">
host_notification_commands mail-notification</font>
<br><font size=2 face="sans-serif">
service_notification_commands mail-notification</font>
<br><font size=2 face="sans-serif">
register
0</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">define contact {</font>
<br><font size=2 face="sans-serif">
contact_name user1-mail</font>
<br><font size=2 face="sans-serif">
use
generic-contact-mail</font>
<br><font size=2 face="sans-serif">
alias User1</font>
<br><font size=2 face="sans-serif">
email user1@firma.com</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">define contact {</font>
<br><font size=2 face="sans-serif"> name
generic-contact-sms</font>
<br><font size=2 face="sans-serif"> host_notification_period
24x7</font>
<br><font size=2 face="sans-serif"> service_notification_period
24x7</font>
<br><font size=2 face="sans-serif"> host_notification_options
d,r</font>
<br><font size=2 face="sans-serif"> service_notification_options
u,c,r</font>
<br><font size=2 face="sans-serif"> host_notification_commands
sms-notification</font>
<br><font size=2 face="sans-serif"> service_notification_commands
sms-notification</font>
<br><font size=2 face="sans-serif"> register
0</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">define contact {
</font>
<br><font size=2 face="sans-serif"> contact_name
user1-sms</font>
<br><font size=2 face="sans-serif"> use
generic-contact-sms</font>
<br><font size=2 face="sans-serif"> alias
S R</font>
<br><font size=2 face="sans-serif"> pager
+49-DONT-CALL-ME</font>
<br><font size=2 face="sans-serif">} </font>
<br>
<br>
<br><font size=2 face="sans-serif">Service Templates and Service:</font>
<br>
<br>
<br><font size=2 face="sans-serif">define service {</font>
<br><font size=2 face="sans-serif">
name
generic-service</font>
<br><font size=2 face="sans-serif">
is_volatile
0</font>
<br><font size=2 face="sans-serif">
check_period
24x7</font>
<br><font size=2 face="sans-serif">
max_check_attempts
3</font>
<br><font size=2 face="sans-serif">
normal_check_interval
1</font>
<br><font size=2 face="sans-serif">
retry_check_interval
3</font>
<br><font size=2 face="sans-serif">
active_checks_enabled
1</font>
<br><font size=2 face="sans-serif">
passive_checks_enabled
1</font>
<br><font size=2 face="sans-serif">
parallelize_check
1</font>
<br><font size=2 face="sans-serif">
obsess_over_service
0</font>
<br><font size=2 face="sans-serif">
check_freshness
1</font>
<br><font size=2 face="sans-serif">
freshness_threshold
120</font>
<br><font size=2 face="sans-serif">
notifications_enabled
1</font>
<br><font size=2 face="sans-serif">
notification_interval
60</font>
<br><font size=2 face="sans-serif">
notification_period
24x7</font>
<br><font size=2 face="sans-serif">
notification_options
u,c,w,r</font>
<br><font size=2 face="sans-serif">
event_handler_enabled
1</font>
<br><font size=2 face="sans-serif">
flap_detection_enabled
1</font>
<br><font size=2 face="sans-serif">
process_perf_data
1</font>
<br><font size=2 face="sans-serif">
retain_status_information 1</font>
<br><font size=2 face="sans-serif">
retain_nonstatus_information 1</font>
<br><font size=2 face="sans-serif">
register
0</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">define service {</font>
<br><font size=2 face="sans-serif">
service_description
CPU</font>
<br><font size=2 face="sans-serif">
use
generic-service</font>
<br><font size=2 face="sans-serif">
host_name
vmctx01</font>
<br><font size=2 face="sans-serif">
check_command
check_nrpe_cpu!99%!100%</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br>
<br><font size=2 face="sans-serif">Service Escalation Templates and Escalations:
(the escalation_period at that time was workhours)</font>
<br>
<br>
<br><font size=2 face="sans-serif">define serviceescalation {</font>
<br><font size=2 face="sans-serif">
name
service-minor-nonworkhours</font>
<br><font size=2 face="sans-serif">
first_notification
4</font>
<br><font size=2 face="sans-serif">
last_notification
4</font>
<br><font size=2 face="sans-serif">
notification_interval
60</font>
<br><font size=2 face="sans-serif">
escalation_period
nonworkhours</font>
<br><font size=2 face="sans-serif">
escalation_options
r,c</font>
<br><font size=2 face="sans-serif">
register
0</font>
<br><font size=2 face="sans-serif">}
</font>
<br><font size=2 face="sans-serif">
</font>
<br><font size=2 face="sans-serif">
</font>
<br><font size=2 face="sans-serif">define serviceescalation {</font>
<br><font size=2 face="sans-serif">
name
service-minor-workhours</font>
<br><font size=2 face="sans-serif">
first_notification
2</font>
<br><font size=2 face="sans-serif">
last_notification
2</font>
<br><font size=2 face="sans-serif">
notification_interval
60</font>
<br><font size=2 face="sans-serif">
escalation_period
workhours</font>
<br><font size=2 face="sans-serif">
escalation_options
r,c</font>
<br><font size=2 face="sans-serif">
register
0</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">define serviceescalation {</font>
<br><font size=2 face="sans-serif">
use
service-minor-nonworkhours</font>
<br><font size=2 face="sans-serif">
host_name
essctxsir06,essctx10,essctx04,essctxulg04,essctx11,essctxulg03,essctxsir03,essctxj0</font>
<br><font size=2 face="sans-serif">1,essctxsir02,essctx03,essctxb06,essctxulg02,essctxsir05,essctxb01,essctxulg05,essctxulg01,essctx07,essctxulg06,essctxtest0</font>
<br><font size=2 face="sans-serif">1,essctxtest01a,vmctx01,vmctx02,vmctx03,vmctx05,vmnrzctxulg03,vmnrzctxulg02,vmnrzctxulg01,nrzctxsir02,nrzctxsir01,nrzctxpps</font>
<br><font size=2 face="sans-serif">02,nrzctxpps01,nrzctxpcs01,nrzctxpcs02,vmnrzctxpcs02</font>
<br><font size=2 face="sans-serif">
service_description
*</font>
<br><font size=2 face="sans-serif">
contact_groups
citrixadmins,citrixadmins-sms</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br>
<br><font size=2 face="sans-serif">define serviceescalation {</font>
<br><font size=2 face="sans-serif">
use
service-minor-workhours</font>
<br><font size=2 face="sans-serif">
host_name
essctxsir06,essctx10,essctx04,essctxulg04,essctx11,essctxulg03,essctxsir03,essctxj0</font>
<br><font size=2 face="sans-serif">1,essctxsir02,essctx03,essctxb06,essctxulg02,essctxsir05,essctxb01,essctxulg05,essctxulg01,essctx07,essctxulg06,essctxtest0</font>
<br><font size=2 face="sans-serif">1,essctxtest01a,vmctx01,vmctx02,vmctx03,vmctx05,vmnrzctxulg03,vmnrzctxulg02,vmnrzctxulg01,nrzctxsir02,nrzctxsir01,nrzctxpps</font>
<br><font size=2 face="sans-serif">02,nrzctxpps01,nrzctxpcs01,nrzctxpcs02,vmnrzctxpcs02</font>
<br><font size=2 face="sans-serif">
service_description
*</font>
<br><font size=2 face="sans-serif">
contact_groups
citrixadmins,citrixadmins-sms</font>
<br><font size=2 face="sans-serif">}</font>
<br>
<br><font size=2 face="sans-serif">-- <br>
Sascha Runschke<br>
Netzwerk- und Systemmanagement<br>
Telefon : +49 (201) 102-1879 Mobil : +49 (173) 5419665 Fax : +49 (201)
102-1102105</font>
<br>
<br>
<br>
<span style="font-family:sans-serif,helvetica; font-size:10pt; color:#000000">GFKL Financial Services AG</span><br>
<span style="font-family:sans-serif,helvetica; font-size:10pt; color:#000000">Vorstand: Dr. Peter Jänsch (Vors.), Jürgen Baltes, Dr. Till Ergenzinger, Dr. Tom Haverkamp</span><br>
<span style="font-family:sans-serif,helvetica; font-size:10pt; color:#000000">Vorsitzender des Aufsichtsrats: Dr. Georg F. Thoma</span><br>
<span style="font-family:sans-serif,helvetica; font-size:10pt; color:#000000">Sitz: Limbecker Platz 1, 45127 Essen, Amtsgericht Essen, HRB 13522</span>