Dear fellow Nagios users,<br><br>We have a bunch of servers on which we have shared filesystems. The filesystem actually resides on the SAN storage and are connected to physical servers. Let's say /home is shared among server1 and server2. When /home breaches the WARNING or CRITICAL threshold it raises two tickets, one from each server.<br>
<br>I tried utilising the check_cluster and check_cluster2 plugins to raise only one ticket when there is a problem instead of two.<br><br>I have the following definitions...<br><br>"Disk Monitor /home" is being monitored on server1 and server2 using NRPE with notifications disabled.<br>
<br>hostgroups<br>==========<br><br>define hostgroup {<br> hostgroup_name clustered_servers<br> alias Clustered servers<br> members server1,server2<br>
}<br><br><br><br>servicecluster.cfg<br>==================<br>define service {<br> service_description Clustered disk /home<br> hostgroup_name clustered_servers<br> check_command check_cluster_service2!$SERVICESTATEID:server1:Disk Monitor /home$,$SERVICESTATEID:server2:Disk Monitor /home$<br>
max_check_attempts 3<br> check_interval 2<br> retry_interval 1<br> notification_interval 15<br> contacts sysadms<br>
register 1<br> }<br><br><br>It seems when the hostgroup_name expands internally in Nagios it creates two service checks in both the servers so it is understood that there will be two alerts. What changes can I make to raise only one ticket? In certain cases we have around 10 servers with shared filesystems, so you can understand the havoc it creates.<br>
<br>Please let me know if you have any questions.<br><br>Thanks,<br>Ashish<br>