Unneeded alerts from Nagios
Monappallil, George
george.monappallil at rpfl.com
Wed Dec 5 21:16:36 CET 2007
hi:
I have a nagios 2.9 instance running on an ESX linux guest. The problem
we are seeing is that whenever we lose and regain network connectivity
to the host, nagios wrongly sends a bunch of server down and server up
alerts for all the servers that nagios is monitoring.
this is how my hosts.cfg looks like for a typical hosts
define host{
name generic-host ; Generic
template name
notifications_enabled 1 ; Host
notifications are enabled
event_handler_enabled 1 ; Host event
handler is enabled
flap_detection_enabled 1 ; Flap detection
is enabled
process_perf_data 1 ; Process
performance data
retain_status_information 1 ; Retain status
information
retain_nonstatus_information 1 ; Retain
non-status information
register 0 ; DONT REGISTER
THIS DEFINITION
}
# This creates a generic host that your routers can use
# monitors host(s) 24x7, notifies on down and recovery, checks 15 times
before going critical,
# notifies the contact_group every 30 minutes
define host{
name basic-host
use generic-host
check_command check-host-alive
max_check_attempts 10
notification_interval 30
notification_period 24x7
notification_options d,r
register 0
}
#adelphi
define host{
use basic-host
host_name adelphi
alias adelphi
address 172.xx.xx.xx (intentional)
contact_groups rpfl-it
}
this is how my services.cfg file looks like
-----
define service{
name generic-service ; Generic
service name
active_checks_enabled 1 ; Active service
checks are enabled
passive_checks_enabled 1 ; Passive
service checks are enabled/accepted
parallelize_check 1 ; Active service
checks should be parallelized
obsess_over_service 1 ; We should
obsess over this service
check_freshness 0 ; Default is to
NOT check service 'freshness'
notifications_enabled 1 ; Service
notifications are enabled
event_handler_enabled 1 ; Service event
handler is enabled
flap_detection_enabled 1 ; Flap detection
is enabled
process_perf_data 1 ; Process
performance data
retain_status_information 1 ; Retain status
information
retain_nonstatus_information 1 ; Retain
non-status information
register 0 ; DONT REGISTER
THIS DEFINITION
}
define service{
use generic-service
name basic-service
is_volatile 0
check_period 24x7
max_check_attempts 15
normal_check_interval 10
retry_check_interval 2
notification_interval 0
notification_period none
register 0
}
# Generic for all services
# PING - ensure HOSTS are available.
define service{
use basic-service
name ping-service
service_description PING
notification_interval 30
contact_groups rpfl-it
hostgroup_name PROD1
notification_options c,r
notification_period 24x7
check_command check_ping!1000.0,20%!2000.0,60%
}
-----
the question I have is why would nagios send DOWN/UP alerts for all the
hosts it is monitoring when it is just the host that it is on loses
connectivity.
thanks in advance
-George
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20071205/c330ddb7/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell. From the desktop to the data center, Linux is going
mainstream. Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list