Karssen,<br> <br> Your problem appears to be process not the tool. I am not suggesting how to conduct business in your organization but here is my suggestion.<br><br> I would reach out to noc management and put together a wiki or how to for each unique possible critical alert scenario. The I would add the "notes_url" parameter of Nagios to point to that particular wiki page. <br>
<br> Only thing noc has to do click on the notes_url page and they will know exactly what to do with the alert. Thanks<br><br><div class="gmail_quote">On Mon, Feb 8, 2010 at 3:40 AM, L.C. Karssen <span dir="ltr"><<a href="mailto:lennart.karssen@snow.nl">lennart.karssen@snow.nl</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">Dear list,<br>
<br>
I'm presently working on a Nagios 3.2.0 setup that monitors approximately<br>
1000 hosts and about 5000 services. The setup doesn't make use of Nagios'<br>
notification system, instead people at a control center (NOC) use the<br>
Nagios web interface to alert the appropriate people in case of an alert.<br>
<br>
The Nagios configuration is based on a set of host (group) templates,<br>
where services are assigned to host groups. For example: the 'check_swap'<br>
service definition is associated with the host group 'all unix hosts'.<br>
<br>
The problem I'm confronted with is that the people at the NOC don't need<br>
to see all services on a given host. For example, they shouldn't call the<br>
sysadmin at night if an SSL certificate check goes into critical state<br>
because the certificate is only valid for ten more days. So we want to<br>
remove that service from their view. In the present situation contact<br>
groups (used to determine which servers are visible to which department)<br>
are added to each specific host, but according to the Nagios docs<br>
(<a href="http://nagios.sourceforge.net/docs/3_0/cgiauth.html" target="_blank">http://nagios.sourceforge.net/docs/3_0/cgiauth.html</a>) a contact group can<br>
see _all_ services on a given host if it is listed as a contract group for<br>
the host.<br>
So I decided to remove the NOC contact group from the individual host<br>
definitions and to assign the NOC contact group only to specific services.<br>
<br>
This leads me to another problem. Some service checks (i.e. host groups)<br>
are used in one department only. This works fine.<br>
However, some other service checks (like check_swap for the 'all unix<br>
hosts' hostgroup) are shared by all departments, but some departments<br>
don't want the NOC to see check_swap alerts whereas others do want to pass<br>
these alerts to NOC. It would be possible to make services with slightly<br>
different names (e.g. check_swap_dept1, check_swap_dept2), each with the<br>
correct contact group. However, that seems to be a needless increase of<br>
complexity.<br>
Another approach would be to make host groups for each department and<br>
somehow change the service contact group for each host group.<br>
Unfortunately I haven't been able to get that to work.<br>
<br>
<br>
Any suggestions would be highly appreciated.<br>
<br>
<br>
Lennart Karssen.<br>
<br>
<br>
------------------------------------------------------------------------------<br>
The Planet: dedicated and managed hosting, cloud storage, colocation<br>
Stay online with enterprise data centers and the best network in the business<br>
Choose flexible plans and management services without long-term contracts<br>
Personal 24x7 support from experience hosting pros just a phone call away.<br>
<a href="http://p.sf.net/sfu/theplanet-com" target="_blank">http://p.sf.net/sfu/theplanet-com</a><br>
_______________________________________________<br>
Nagios-users mailing list<br>
<a href="mailto:Nagios-users@lists.sourceforge.net">Nagios-users@lists.sourceforge.net</a><br>
<a href="https://lists.sourceforge.net/lists/listinfo/nagios-users" target="_blank">https://lists.sourceforge.net/lists/listinfo/nagios-users</a><br>
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.<br>
::: Messages without supporting info will risk being sent to /dev/null<br>
</blockquote></div><br><br clear="all"><br>-- <br>Cordially,<br>Shadhin Rahman<br>