problems with distributed monitoring
Dmitriy Kirhlarov
dkirhlarov at oilspace.com
Thu Feb 3 17:00:00 CET 2005
Hi!
We are trying to implement distributed monitoring scheme with snmp and passive checks, in general.
For transfering passive checks results from a client to "monitoring" nagios we use nsca without crypting (BTW, nsca-2.4 can't be compiled without libmcrypt. We had be write a patch)
Data are transfered every min.
"Monitoring" nagios has oscp_command set to script, which puts data to "to_send.dump" file. Every 30 sec other script transfers accumulated data from the "to_send.dump" file to "central" nagios via nsca over ssh-forwarded port.
All work fine on the "monitoring" nagios. All services "OK".
But, on "central" nagios most part of services are in flapping and/or unknown status.
As I can see, those services was configured as active checks on "monitoring" nagios.
My templates:
on "monitoring" nagios:
-----
define service{
name generic-service
register 0
is_volatile 0
max_check_attempts 1
normal_check_interval 1
retry_check_interval 1
check_period 24x7
parallelize_check 1
obsess_over_service 1
check_freshness 1
event_handler_enabled 1
flap_detection_enabled 1
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
notification_interval 120
notification_period 24x7
notification_options w,u,c,r
notifications_enabled 0
contact_groups admins
}
define service{
use generic-service
register 0
name minfra01-active
freshness_threshold 180
passive_checks_enabled 1
active_checks_enabled 1
}
define service{
use generic-service
register 0
name minfra01-passive
freshness_threshold 180
passive_checks_enabled 1
active_checks_enabled 0
}
-----
On "central" nagios:
----
define service{
name generic-service
register 0
is_volatile 0
max_check_attempts 3
normal_check_interval 4
retry_check_interval 2
check_period 24x7
parallelize_check 1
obsess_over_service 1
check_freshness 1
event_handler_enabled 1
flap_detection_enabled 1
process_perf_data 1
retain_status_information 1
retain_nonstatus_information 1
notification_interval 120
notification_period 24x7
notification_options w,u,c,r
notifications_enabled 0
contact_groups admins
}
define service{
use generic-service
register 0
name minfra01-active
passive_checks_enabled 1
active_checks_enabled 0
}
define service{
use generic-service
register 0
name minfra01-passive
freshness_threshold 600
passive_checks_enabled 1
active_checks_enabled 0
}
----
WBR
--
Dmitriy Kirhlarov
OILspace, 4, B. Spasskaya str. room 01, 107978 Moscow, Russia
P:+7 095 787 4700 F:+7 095 787 4701 E:DmitriyKirhlarov at oilspace.com
OILspace - The resource enriched - www.oilspace.com
-------------------------------------------------------
This SF.Net email is sponsored by: IntelliVIEW -- Interactive Reporting
Tool for open source databases. Create drag-&-drop reports. Save time
by over 75%! Publish reports on the web. Export to DOC, XLS, RTF, etc.
Download a FREE copy at http://www.intelliview.com/go/osdn_nl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list