<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.6000.16525" name=GENERATOR></HEAD>
<BODY>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>hi:</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>I have a nagios 2.9
instance running on an ESX linux guest. The problem we are seeing is that
whenever we lose and regain network connectivity to the host, nagios
wrongly sends a bunch of server down and server up alerts for all the
servers that nagios is monitoring. </FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>this is how my hosts.cfg
looks like for a typical hosts</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>define
host{<BR>
name
generic-host ; Generic template
name<BR>
notifications_enabled
1
; Host notifications are enabled<BR>
event_handler_enabled
1
; Host event handler is enabled<BR>
flap_detection_enabled
1
; Flap detection is enabled<BR>
process_perf_data
1
; Process performance data<BR>
retain_status_information
1
; Retain status information<BR>
retain_nonstatus_information
1
; Retain non-status information<BR>
register
0
; DONT REGISTER THIS DEFINITION<BR>
}</FONT></SPAN></DIV>
<DIV> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond># This creates a generic
host that your routers can use<BR># monitors host(s) 24x7, notifies on down and
recovery, checks 15 times before going critical,<BR># notifies the contact_group
every 30 minutes<BR>define host{<BR>
name
basic-host<BR>
use
generic-host<BR>
check_command
check-host-alive<BR>
max_check_attempts
10<BR>
notification_interval
30<BR>
notification_period
24x7<BR>
notification_options
d,r<BR>
register
0<BR> }</FONT></SPAN></DIV>
<DIV> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>#adelphi<BR>define
host{<BR>
use
basic-host<BR>
host_name
adelphi<BR>
alias
adelphi<BR>
address
172.xx.xx.xx (intentional)</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond>
contact_groups
rpfl-it<BR> }</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond></FONT></SPAN> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>this is how my
services.cfg file looks like</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond>-----</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>define
service{<BR>
name
generic-service ; Generic service
name<BR>
active_checks_enabled
1
; Active service checks are
enabled<BR>
passive_checks_enabled
1
; Passive service checks are
enabled/accepted<BR>
parallelize_check
1
; Active service checks should be
parallelized<BR>
obsess_over_service
1
; We should obsess over this
service<BR>
check_freshness
0
; Default is to NOT check service
'freshness'<BR>
notifications_enabled
1
; Service notifications are
enabled<BR>
event_handler_enabled
1
; Service event handler is enabled<BR>
flap_detection_enabled
1
; Flap detection is enabled<BR>
process_perf_data
1
; Process performance data<BR>
retain_status_information
1
; Retain status information<BR>
retain_nonstatus_information
1
; Retain non-status information<BR>
register
0
; DONT REGISTER THIS DEFINITION<BR>
}</FONT></SPAN></DIV>
<DIV> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>define
service{<BR>
use
generic-service<BR>
name
basic-service<BR>
is_volatile
0<BR>
check_period
24x7<BR>
max_check_attempts
15<BR>
normal_check_interval
10<BR>
retry_check_interval
2<BR>
notification_interval
0<BR>
notification_period
none<BR>
register
0<BR> }</FONT></SPAN></DIV>
<DIV> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond># Generic for all
services<BR># PING - ensure HOSTS are available.<BR>define
service{<BR>
use
basic-service<BR>
name
ping-service<BR>
service_description
PING<BR>
notification_interval
30<BR>
contact_groups
rpfl-it</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond>
hostgroup_name
PROD1<BR>
notification_options
c,r<BR>
notification_period
24x7<BR>
check_command
check_ping!1000.0,20%!2000.0,60%<BR>
}</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond>-----</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond></FONT></SPAN> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>the question I have is
why would nagios send DOWN/UP alerts for all the hosts it is monitoring when it
is just the host that it is on loses connectivity.</FONT></SPAN></DIV>
<DIV><SPAN class=846550020-05122007><FONT
face=Garamond></FONT></SPAN> </DIV>
<DIV><SPAN class=846550020-05122007><FONT face=Garamond>thanks in
advance</FONT></SPAN></DIV><!-- Converted from text/rtf format -->
<P><SPAN lang=en-us><FONT face=Garamond>-George</FONT></SPAN> </P>
<DIV><FONT face=Garamond></FONT> </DIV></BODY></HTML>