Unneeded alerts from Nagios

Monappallil, George george.monappallil at rpfl.com
Wed Dec 5 21:16:36 CET 2007
Previous message: Monitoring Oracle
Next message: Unneeded alerts from Nagios
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
hi:
I have a nagios 2.9 instance running on an ESX linux guest. The problem
we are seeing is that whenever we lose and regain network connectivity
to the host, nagios wrongly sends a bunch of server down and server up
alerts for all the servers that nagios is monitoring. 
this is how my hosts.cfg looks like for a typical hosts
define host{
        name                            generic-host    ; Generic
template name
        notifications_enabled           1               ; Host
notifications are enabled
        event_handler_enabled           1               ; Host event
handler is enabled
        flap_detection_enabled          1               ; Flap detection
is enabled
        process_perf_data               1               ; Process
performance data
        retain_status_information       1               ; Retain status
information
        retain_nonstatus_information    1               ; Retain
non-status information
        register                        0               ; DONT REGISTER
THIS DEFINITION
        }
 
# This creates a generic host that your routers can use
# monitors host(s) 24x7, notifies on down and recovery, checks 15 times
before going critical,
# notifies the contact_group every 30 minutes
define host{
        name                    basic-host
        use                     generic-host
        check_command           check-host-alive
        max_check_attempts      10
        notification_interval   30
        notification_period     24x7
        notification_options    d,r
        register                0
        }
 
#adelphi
define host{
        use                     basic-host
        host_name               adelphi
        alias                   adelphi
        address                 172.xx.xx.xx (intentional)
        contact_groups          rpfl-it
        }
 
this is how my services.cfg file looks like
-----
define service{
        name                            generic-service ; Generic
service name
        active_checks_enabled           1               ; Active service
checks are enabled
        passive_checks_enabled          1               ; Passive
service checks are enabled/accepted
        parallelize_check               1               ; Active service
checks should be parallelized
        obsess_over_service             1               ; We should
obsess over this service
        check_freshness                 0               ; Default is to
NOT check service 'freshness'
        notifications_enabled           1               ; Service
notifications are enabled
        event_handler_enabled           1               ; Service event
handler is enabled
        flap_detection_enabled          1               ; Flap detection
is enabled
        process_perf_data               1               ; Process
performance data
        retain_status_information       1               ; Retain status
information
        retain_nonstatus_information    1               ; Retain
non-status information
        register                        0               ; DONT REGISTER
THIS DEFINITION
        }
 
define service{
        use                             generic-service
        name                            basic-service
        is_volatile                     0
        check_period                    24x7
        max_check_attempts              15
        normal_check_interval           10
        retry_check_interval            2
        notification_interval           0
        notification_period             none
        register                        0
        }
 
# Generic for all services
# PING - ensure HOSTS are available.
define service{
        use                             basic-service
        name                            ping-service
        service_description             PING
        notification_interval           30
        contact_groups                  rpfl-it
        hostgroup_name                  PROD1
        notification_options            c,r
        notification_period             24x7
        check_command                   check_ping!1000.0,20%!2000.0,60%
        }
-----
 
the question I have is why would nagios send DOWN/UP alerts for all the
hosts it is monitoring when it is just the host that it is on loses
connectivity.
 
thanks in advance

-George 

 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20071205/c330ddb7/attachment.html>
-------------- next part --------------
-------------------------------------------------------------------------
SF.Net email is sponsored by: The Future of Linux Business White Paper
from Novell.  From the desktop to the data center, Linux is going
mainstream.  Let it simplify your IT future.
http://altfarm.mediaplex.com/ad/ck/8857-50307-18918-4
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null
Previous message: Monitoring Oracle
Next message: Unneeded alerts from Nagios
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]
More information about the Users mailing list