<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:x="urn:schemas-microsoft-com:office:excel" xmlns:p="urn:schemas-microsoft-com:office:powerpoint" xmlns:a="urn:schemas-microsoft-com:office:access" xmlns:dt="uuid:C2F41010-65B3-11d1-A29F-00AA00C14882" xmlns:s="uuid:BDC6E3F0-6DA3-11d1-A2A3-00AA00C14882" xmlns:rs="urn:schemas-microsoft-com:rowset" xmlns:z="#RowsetSchema" xmlns:b="urn:schemas-microsoft-com:office:publisher" xmlns:ss="urn:schemas-microsoft-com:office:spreadsheet" xmlns:c="urn:schemas-microsoft-com:office:component:spreadsheet" xmlns:oa="urn:schemas-microsoft-com:office:activation" xmlns:html="http://www.w3.org/TR/REC-html40" xmlns:q="http://schemas.xmlsoap.org/soap/envelope/" xmlns:D="DAV:" xmlns:x2="http://schemas.microsoft.com/office/excel/2003/xml" xmlns:ois="http://schemas.microsoft.com/sharepoint/soap/ois/" xmlns:dir="http://schemas.microsoft.com/sharepoint/soap/directory/" xmlns:ds="http://www.w3.org/2000/09/xmldsig#" xmlns:dsp="http://schemas.microsoft.com/sharepoint/dsp" xmlns:udc="http://schemas.microsoft.com/data/udc" xmlns:xsd="http://www.w3.org/2001/XMLSchema" xmlns:sub="http://schemas.microsoft.com/sharepoint/soap/2002/1/alerts/" xmlns:ec="http://www.w3.org/2001/04/xmlenc#" xmlns:sp="http://schemas.microsoft.com/sharepoint/" xmlns:sps="http://schemas.microsoft.com/sharepoint/soap/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:udcxf="http://schemas.microsoft.com/data/udc/xmlfile" xmlns:wf="http://schemas.microsoft.com/sharepoint/soap/workflow/" xmlns:mver="http://schemas.openxmlformats.org/markup-compatibility/2006" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns:mrels="http://schemas.openxmlformats.org/package/2006/relationships" xmlns:ex12t="http://schemas.microsoft.com/exchange/services/2006/types" xmlns:ex12m="http://schemas.microsoft.com/exchange/services/2006/messages" xmlns:Z="urn:schemas-microsoft-com:" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv=Content-Type content="text/html; charset=us-ascii">
<meta name=Generator content="Microsoft Word 12 (filtered medium)">
<style>
<!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;}
@page Section1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.Section1
{page:Section1;}
-->
</style>
<!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang=EN-US link=blue vlink=purple>
<div class=Section1>
<p class=MsoNormal>Sorry for the long subject and post. We’re running
2.10 on CentOS 5. When we acknowledge a service alert that goes into warning,
we’re not receiving an alert when it goes into critical. <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>For example: we’re monitoring the E drive on a file
server. The drive goes into a warning state, Nagios sends an alert, and an
acknowledgement is entered. Later the drive goes critical, but an alert is
never sent. Following are the relevant log entries and config files. Thanks for
the help!<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Log File:<o:p></o:p></p>
<p class=MsoNormal>E drive goes into warning<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:10:38 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;WARNING;notify-by-epager;e:\ - total: 263.99 Gb - used:
243.89 Gb (92%) - free 20.10 Gb (8%) <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>E drive is acknowledged<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:26 DataCenterMon nagios: EXTERNAL COMMAND:
ACKNOWLEDGE_SVC_PROBLEM;X;Disk Usage E Drive;2;1;1;Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Acknowledge is sent<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:26 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;ACKNOWLEDGEMENT (WARNING);notify-by-email;e:\ - total:
263.99 Gb - used: 243.89 Gb (92%) - free 20.10 Gb (8%);Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:27 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;ACKNOWLEDGEMENT (WARNING);notify-by-epager;e:\ - total:
263.99 Gb - used: 243.89 Gb (92%) - free 20.10 Gb (8%);Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>E drive goes critical no alert sent<o:p></o:p></p>
<p class=MsoNormal>Apr 30 10:07:16 DataCenterMon nagios: SERVICE ALERT: X;Disk
Usage E Drive;CRITICAL;HARD;3;e:\ - total: 263.99 Gb - used: 251.33 Gb (95%) -
free 12.67 Gb (5%) <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:04:16 DataCenterMon nagios: EXTERNAL COMMAND:
SCHEDULE_FORCED_SVC_CHECK;X;Disk Usage E Drive;1209578654 <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Acknowledgement is removed and alert is sent.<o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:19 DataCenterMon nagios: EXTERNAL COMMAND:
REMOVE_SVC_ACKNOWLEDGEMENT;X;Disk Usage E Drive <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:49 DataCenterMon nagios: EXTERNAL COMMAND:
SCHEDULE_FORCED_SVC_CHECK;X;Disk Usage E Drive;1209578747 <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:57 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;CRITICAL;notify-by-email;e:\ - total: 263.99 Gb - used:
254.71 Gb (96%) - free 9.29 Gb (4%)<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal># Host Template for Critical Hosts -- [E]Pager and Email
Notification to x 27x7<o:p></o:p></p>
<p class=MsoNormal>define host{<o:p></o:p></p>
<p class=MsoNormal> name Critical_Host ;
The name of this host template - referenced in other host definitions, used for
template recursion/resolution<o:p></o:p></p>
<p class=MsoNormal> notifications_enabled 1 ;
Host notifications are enabled<o:p></o:p></p>
<p class=MsoNormal> event_handler_enabled 1 ;
Host event handler is enabled<o:p></o:p></p>
<p class=MsoNormal> flap_detection_enabled 1 ;
Flap detection is enabled<o:p></o:p></p>
<p class=MsoNormal> process_perf_data 1 ;
Process performance data<o:p></o:p></p>
<p class=MsoNormal> retain_status_information 1 ;
Retain status information across program restarts<o:p></o:p></p>
<p class=MsoNormal> retain_nonstatus_information 1 ;
Retain non-status information across program restarts<o:p></o:p></p>
<p class=MsoNormal> notification_period 24x7 ;
Notifies 24x365<o:p></o:p></p>
<p class=MsoNormal> notification_options d,u,r ;Down,
Up, Recovery<o:p></o:p></p>
<p class=MsoNormal> notification_interval 5 ;Sends
Page/Email every 5 minutes<o:p></o:p></p>
<p class=MsoNormal> check_command check_ping!1000.0,20%!30000.0,100% ;Warns
at 20% packet loss or round trip time > 1000 MS Critical at 100% packet loss
or 30000 MS roun trip <o:p></o:p></p>
<p class=MsoNormal> max_check_attempts 5 ;Checks
host 5 times before generating an alert<o:p></o:p></p>
<p class=MsoNormal> contact_groups x<o:p></o:p></p>
<p class=MsoNormal> register 0 ;
DONT REGISTER THIS DEFINITION - ITS NOT A REAL HOST, JUST A TEMPLATE!<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal># 'NWWEBNAS' host definition<o:p></o:p></p>
<p class=MsoNormal>define host{<o:p></o:p></p>
<p class=MsoNormal> use Critical_Host ;
Name of host template to use<o:p></o:p></p>
<p class=MsoNormal> host_name X<o:p></o:p></p>
<p class=MsoNormal> alias Production
File Server<o:p></o:p></p>
<p class=MsoNormal> address x.x.x.x<o:p></o:p></p>
<p class=MsoNormal> parents X<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal># Crtitical Service definition template<o:p></o:p></p>
<p class=MsoNormal>define service{<o:p></o:p></p>
<p class=MsoNormal> name Critical_Service ;
The 'name' of this service template, referenced in other service definitions<o:p></o:p></p>
<p class=MsoNormal> active_checks_enabled 1 ;
Active service checks are enabled<o:p></o:p></p>
<p class=MsoNormal> passive_checks_enabled 1 ;
Passive service checks are enabled/accepted<o:p></o:p></p>
<p class=MsoNormal> parallelize_check 1 ;
Active service checks should be parallelized (disabling this can lead to major
performance problems)<o:p></o:p></p>
<p class=MsoNormal> obsess_over_service 1 ;
We should obsess over this service (if necessary)<o:p></o:p></p>
<p class=MsoNormal> is_volatile 0<o:p></o:p></p>
<p class=MsoNormal> check_freshness 0 ;
Default is to NOT check service 'freshness'<o:p></o:p></p>
<p class=MsoNormal> notifications_enabled 1 ;
Service notifications are enabled<o:p></o:p></p>
<p class=MsoNormal> event_handler_enabled 1 ;
Service event handler is enabled<o:p></o:p></p>
<p class=MsoNormal> flap_detection_enabled 1 ;
Flap detection is enabled<o:p></o:p></p>
<p class=MsoNormal> process_perf_data 1 ;
Process performance data<o:p></o:p></p>
<p class=MsoNormal> retain_status_information 1 ;
Retain status information across program restarts<o:p></o:p></p>
<p class=MsoNormal> retain_nonstatus_information 1 ;
Retain non-status information across program restarts<o:p></o:p></p>
<p class=MsoNormal> event_handler_enabled 1 ;Event
handler is enabled<o:p></o:p></p>
<p class=MsoNormal> check_period 24x7_With_Maintenance_Window ;Checks
24x7x365<o:p></o:p></p>
<p class=MsoNormal> normal_check_interval 10 ;When
service is OK it will be checked every 10 minutes<o:p></o:p></p>
<p class=MsoNormal> max_check_attempts 3 ;When
service is not OK it will check 3 times before sending an alert<o:p></o:p></p>
<p class=MsoNormal> retry_check_interval 1 ;Retries
every 1 minute once service is not OK. After max_check_attempts has bee reached
it rechecks at normal_check_interval<o:p></o:p></p>
<p class=MsoNormal> notification_interval 10 ;Sends
notifications every 10 minutes<o:p></o:p></p>
<p class=MsoNormal> notification_period 24x7 ;
Notifies 24x365<o:p></o:p></p>
<p class=MsoNormal> notification_options w,u,c,r ;Sends
alerts at Warning, Unreachable, Critical and Recovery<o:p></o:p></p>
<p class=MsoNormal> contact_groups x ;Email
ISOpsOnCall and pages ISOnCallCell<o:p></o:p></p>
<p class=MsoNormal> register 0 ;
DONT REGISTER THIS DEFINITION - ITS NOT A REAL SERVICE, JUST A TEMPLATE!<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal># Service definition<o:p></o:p></p>
<p class=MsoNormal>define service{<o:p></o:p></p>
<p class=MsoNormal> use Critical_Service ;
Name of service template to use<o:p></o:p></p>
<p class=MsoNormal> host_name X<o:p></o:p></p>
<p class=MsoNormal> service_description Disk
Usage E Drive<o:p></o:p></p>
<p class=MsoNormal> check_command check_nt_disk!e!80!95
<o:p></o:p></p>
<p class=MsoNormal> }<o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Log File:<o:p></o:p></p>
<p class=MsoNormal>E drive goes into warning<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:10:38 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;WARNING;notify-by-epager;e:\ - total: 263.99 Gb - used:
243.89 Gb (92%) - free 20.10 Gb (8%) <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>E drive is acknowledged<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:26 DataCenterMon nagios: EXTERNAL COMMAND:
ACKNOWLEDGE_SVC_PROBLEM;X;Disk Usage E Drive;2;1;1;Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Acknowledge is sent<o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:26 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;ACKNOWLEDGEMENT (WARNING);notify-by-email;e:\ - total:
263.99 Gb - used: 243.89 Gb (92%) - free 20.10 Gb (8%);Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal>Apr 29 15:11:27 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;ACKNOWLEDGEMENT (WARNING);notify-by-epager;e:\ - total:
263.99 Gb - used: 243.89 Gb (92%) - free 20.10 Gb (8%);Nagios Admin;jf <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>E drive goes critical no alert sent<o:p></o:p></p>
<p class=MsoNormal>Apr 30 10:07:16 DataCenterMon nagios: SERVICE ALERT: X;Disk
Usage E Drive;CRITICAL;HARD;3;e:\ - total: 263.99 Gb - used: 251.33 Gb (95%) -
free 12.67 Gb (5%) <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:04:16 DataCenterMon nagios: EXTERNAL COMMAND:
SCHEDULE_FORCED_SVC_CHECK;X;Disk Usage E Drive;1209578654 <o:p></o:p></p>
<p class=MsoNormal><o:p> </o:p></p>
<p class=MsoNormal>Acknowledgement is removed and alert is sent.<o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:19 DataCenterMon nagios: EXTERNAL COMMAND:
REMOVE_SVC_ACKNOWLEDGEMENT;X;Disk Usage E Drive <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:49 DataCenterMon nagios: EXTERNAL COMMAND:
SCHEDULE_FORCED_SVC_CHECK;X;Disk Usage E Drive;1209578747 <o:p></o:p></p>
<p class=MsoNormal>Apr 30 11:05:57 DataCenterMon nagios: SERVICE NOTIFICATION:
XX;X;Disk Usage E Drive;CRITICAL;notify-by-email;e:\ - total: 263.99 Gb - used:
254.71 Gb (96%) - free 9.29 Gb (4%)<o:p></o:p></p>
</div>
</body>
</html>