Service Checks in Distributed mode
Ian Marks
imarks at comcast.net
Mon Aug 21 19:41:46 CEST 2006
I have 2 nagios 2.5 servers running; both are doing active checks, but
one acts as the "central" server and receives passive checks from the
other. I have it set up this way so our analysts will only have to
monitor "one" server. I am seeing major delays with service checks on
the central server. I am assuming my server is getting backed up trying
to process all the passive checks, so the active checks are only being
executed every 30-45 minutes. Is this possible? What would be the best
way to solve this problem, removing the passive checks?
Here are the stats:
##Central server##
Program Running Time: 0d 4h 31m 26s
Total Services: 812
Services Checked: 812
Services Scheduled: 439
Active Service Checks: 439
Passive Service Checks: 373
Total Service State Change: 0.000 / 34.210 / 0.121 %
Active Service Latency: 449.349 / 1996.148 / 1329.618 %
Active Service Execution Time: 0.049 / 60.556 / 7.779 sec
Active Service State Change: 0.000 / 11.580 / 0.065 %
Active Services Last 1/5/15/60 min: 0 / 0 / 427 / 439
Passive Service State Change: 0.000 / 34.210 / 0.188 %
Passive Services Last 1/5/15/60 min: 0 / 0 / 94 / 367
Services Ok/Warn/Unk/Crit: 720 / 8 / 18 / 66
Services Flapping: 0
Services In Downtime: 0
Total Hosts: 243
Hosts Checked: 243
Hosts Scheduled: 0
Active Host Checks: 178
Passive Host Checks: 65
Total Host State Change: 0.000 / 27.760 / 0.149 %
Active Host Latency: 0.000 / 2751.689 / 25.196 %
Active Host Execution Time: 0.016 / 10.015 / 1.171 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min: 2 / 5 / 17 / 19
Passive Host State Change: 0.000 / 27.760 / 0.557 %
Passive Hosts Last 1/5/15/60 min: 1 / 32 / 62 / 63
Hosts Up/Down/Unreach: 212 / 31 / 0
Hosts Flapping: 0
Hosts In Downtime: 0
##Server Submitting Passive Checks##
Program Running Time: 3d 20h 24m 19s
Total Services: 367
Services Checked: 367
Services Scheduled: 367
Active Service Checks: 367
Passive Service Checks: 0
Total Service State Change: 0.000 / 6.250 / 0.017 %
Active Service Latency: 233.540 / 349.122 / 284.721 %
Active Service Execution Time: 0.023 / 34.478 / 4.352 sec
Active Service State Change: 0.000 / 6.250 / 0.017 %
Active Services Last 1/5/15/60 min: 0 / 151 / 367 / 367
Passive Service State Change: 0.000 / 0.000 / 0.000 %
Passive Services Last 1/5/15/60 min: 0 / 0 / 0 / 0
Services Ok/Warn/Unk/Crit: 329 / 6 / 18 / 14
Services Flapping: 0
Services In Downtime: 0
Total Hosts: 63
Hosts Checked: 63
Hosts Scheduled: 0
Active Host Checks: 63
Passive Host Checks: 0
Total Host State Change: 0.000 / 0.000 / 0.000 %
Active Host Latency: 0.000 / 381.349 / 309.423 %
Active Host Execution Time: 0.110 / 18.916 / 3.671 sec
Active Host State Change: 0.000 / 0.000 / 0.000 %
Active Hosts Last 1/5/15/60 min: 0 / 35 / 63 / 63
Passive Host State Change: 0.000 / 0.000 / 0.000 %
Passive Hosts Last 1/5/15/60 min: 0 / 0 / 0 / 0
Hosts Up/Down/Unreach: 57 / 6 / 0
Hosts Flapping: 0
Hosts In Downtime: 0
Thanks,
Ian
-------------------------------------------------------------------------
Using Tomcat but need to do more? Need to support web services, security?
Get stuff done quickly with pre-integrated technology to make your job easier
Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list