performance problems
Javier Castillo Alcibar
Javier.Castillo at alhambra-eidos.com
Wed Jul 6 12:05:21 CEST 2005
Hello Hendrik,
The machine is a DL380, 2xCPU 2.8Ghz, 2Gb RAM. I think it has enough hw resources..... Technically:
alhmngsrv2:/etc/nagios# vmstat 5
procs -----------memory---------- ---swap-- -----io---- --system-- ----cpu----
r b swpd free buff cache si so bi bo in cs us sy id wa
0 0 4 215764 222008 594536 0 0 0 0 0 1 6 4 88 2
1 0 4 211028 222008 594536 0 0 0 246 1170 0 4 1 93 2
1 0 4 202228 222008 594536 0 0 0 314 1182 0 13 2 83 3
0 0 4 205300 222008 594536 0 0 0 126 1167 0 8 1 90 1
18 0 4 209660 222008 594536 0 0 0 275 1207 0 1 1 94 4
0 0 4 209596 222008 594536 0 0 0 229 1143 0 2 1 96 2
0 0 4 202428 222012 594532 0 0 0 77 1117 0 5 1 94 0
2 0 4 186940 222016 594528 0 0 0 110 1145 0 19 2 80 0
2 0 4 190076 222016 594528 0 0 0 245 1127 0 1 1 96 2
17 0 4 191020 222016 594528 0 0 0 150 1128 0 2 1 96 2
2 0 4 198316 222016 594460 0 0 0 286 1138 0 3 1 94 2
0 0 4 197004 222020 594456 0 0 0 168 1125 0 2 1 96 2
17 0 4 196436 222028 594448 0 0 0 396 1210 0 2 1 93 3
9 0 4 205908 222028 594448 0 0 0 393 1244 0 3 1 92 3
It's very strange, because, when I start nagios, everything goes nicely, plugins execution times and latencies are ok, but two or three minutes later, nagios starts getting strange time values, for example:
1120644060 FIREWALL PING OK 1 HARD 9 157 FPING OK - 192.168.0.1 (loss=0%, rta=72.400000 ms) loss=0%;10;30;0;100 rta=0.072400s;2.000000;5.000000;0.000000
Executiontime=9 and Latency=157 ¿?¿? Why? This test have had no packet loss, the machine is working nicely....but...
Another strange thing I noticed: if have "max_concurrent_checks=120", why are so many times that nagios is running only 1,2,3,4 plugins at the same time?
It's true, I'm very confused....
Thx in advance,
Javier.
-----Mensaje original-----
De: Hendrik Baecker [mailto:b00mer at gmx.net]
Enviado el: miércoles, 06 de julio de 2005 10:49
Para: Javier Castillo Alcibar
CC: nagios-users at lists.sourceforge.net
Asunto: Re: [Nagios-users] performance problems
Hello Javier,
try to figure out where your Service Latencies comes from.
Hints are:
in nagios.cfg
process_performance_data=0
http://nagios.sourceforge.net/docs/1_0/perfdata.html
Out of the performance data you can perhaps see where your latencies comes from.
Other ways were to view a 'top' and have a look to the swap memory, the load and perhaps the 'wa' CPU count
Its not quiet good to have a server swapping, to have a high load and very ugly if your server is waiting for I/O...
Javier Castillo Alcibar schrieb:
>Hi everybody!!
>
>I have strange performance problem with my nagios 1.3 server(debian
>sarge). It checks very slowly, I can see this info using extinfo.cgi.
>How can I debug this? Any hint/trick? Any parameter?
>
>Program-Wide Performance Information
>
>Active Checks:
> Time Frame Checks Completed
><= 1 minute: 95 (28.1%)
><= 5 minutes: 283 (83.7%)
><= 15 minutes: 338 (100.0%)
><= 1 hour: 338 (100.0%)
>Since program start: 338 (100.0%)
>
>
>Metric Min. Max. Average
>Check Execution Time: < 1 sec 24 sec 7.201 sec
>Check Latency: 99 sec 135 sec 119.867 sec
>Percent State Change: 0.00% 0.00% 0.00%
>
>
>Passive Checks:
> Time Frame Checks Completed
><= 1 minute: 0 (0.0%)
><= 5 minutes: 0 (0.0%)
><= 15 minutes: 0 (0.0%)
><= 1 hour: 0 (0.0%)
>Since program start: 0 (0.0%)
>
>
>Metric Min. Max. Average
>Percent State Change: 0.00% 0.00% 0.00%
>
>#
>log_file=/var/log/nagios/nagios.log
>cfg_file=/etc/nagios/checkcommands.cfg
>cfg_file=/etc/nagios/misccommands.cfg
>cfg_file=/etc/nagios/contactgroups.cfg
>cfg_file=/etc/nagios/contacts.cfg
>cfg_file=/etc/nagios/dependencies.cfg
>cfg_file=/etc/nagios/escalations.cfg
>cfg_file=/etc/nagios/hostgroups.cfg
>cfg_file=/etc/nagios/hosts.cfg
>cfg_file=/etc/nagios/services.cfg
>cfg_file=/etc/nagios/timeperiods.cfg
>resource_file=/etc/nagios/resource.cfg
>temp_file=/var/log/nagios/nagios.tmp
>status_file=/var/log/nagios/status.log
>aggregate_status_updates=1
>status_update_interval=15
>nagios_user=nagios
>nagios_group=nagios
>enable_notifications=1
>execute_service_checks=1
>accept_passive_service_checks=1
>enable_event_handlers=1
>log_rotation_method=d
>log_archive_path=/var/log/nagios/archives
>check_external_commands=1
>command_check_interval=-1
>comment_file=/var/log/nagios/comment.log
>downtime_file=/var/log/nagios/downtime.log
>command_file=/var/log/nagios/rw/nagios.cmd
>lock_file=/var/log/nagios/nagios.lock
>retain_state_information=1
>state_retention_file=/var/log/nagios/status.sav
>retention_update_interval=60
>use_retained_program_state=0
>use_syslog=1
>log_notifications=1
>log_service_retries=1
>log_host_retries=1
>log_event_handlers=1
>log_initial_states=0
>log_external_commands=1
>log_passive_service_checks=1
>sleep_time=1
>max_concurrent_checks=120
>#service_reaper_frequency=10
>service_reaper_frequency=5
>interval_length=60
>use_agressive_host_checking=0
>enable_flap_detection=0
>low_service_flap_threshold=5.0
>high_service_flap_threshold=20.0
>low_host_flap_threshold=5.0
>high_host_flap_threshold=20.0
>soft_state_dependencies=0
>service_check_timeout=30
>host_check_timeout=30
>event_handler_timeout=60
>notification_timeout=60
>ocsp_timeout=5
>perfdata_timeout=5
>obsess_over_services=0
>process_performance_data=0
>check_for_orphaned_services=0
>check_service_freshness=1
>freshness_check_interval=60
>admin_email=nagios
>admin_pager=pagenagios
>date_format=euro
>
>
>
>-------------------------------------------------------
>SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
>from IBM. Find simple to follow Roadmaps, straightforward articles,
>informative Webcasts and more! Get everything you need to get up to
>speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click
>_______________________________________________
>Nagios-users mailing list
>Nagios-users at lists.sourceforge.net
>https://lists.sourceforge.net/lists/listinfo/nagios-users
>::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
>::: Messages without supporting info will risk being sent to /dev/null
>
>
>
>
-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_idt77&alloc_id492&op=click
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list