Buggy Nagios 2.x performance data?
Hugo van der Kooij
hvdkooij at vanderkooij.org
Fri Apr 7 19:08:41 CEST 2006
Hi,
I have build a new system which prime task will be nagios. Nagios seems to
be running just fine and I was adding NagiosGrapher v1.5 to it to handle
the performance data. But it seems I still have issues with performance
data missing.
I started noticing after I added extra servers instead of graphing just
about anything I could check with nagios on my local server.
So I did check with tcpdump to see if the checks are done every 3 minutes
for a particular host. tcpdumps shows me every 3 minutes the right traffic
for the checks (PING + DNS):
08:05:20.144479 IP arwen.waakhond.net.32994 > ns2.mydyndns.org.domain: 20487+ A? www.vanderkooij.org. (37)
08:05:20.263711 IP ns2.mydyndns.org.domain > arwen.waakhond.net.32994: 20487*- 2/5/4 CNAME[|domain]
08:05:45.154599 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 0
08:05:45.274403 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 0
08:05:46.155946 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 1
08:05:46.271167 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 1
08:05:47.156796 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 2
08:05:47.273669 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 2
08:05:48.158648 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 3
08:05:48.278399 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 3
08:05:49.159490 IP arwen.waakhond.net > ns2.mydyndns.org: icmp 64: echo request seq 4
08:05:49.274105 IP ns2.mydyndns.org > arwen.waakhond.net: icmp 64: echo reply seq 4
The ngraph.log file show intermittant behavior consistent with the data in
the grahps.
This behaviour is restricted to the servers I have added after the server
itself. All the checks on the local machine work and are logged
consistently in the grahps.
If I babysit the machine I can see the servers/servicices get updated
regular like clockwork in the nagios web pages. There is in fact new
performance data every 3 minutes on the pages.
At this point I think we can rule out the plugins as Nagios gets the
proper data but somehow they do not seem to make it all the way.
I do appreciate some help in narrowing this one down and, if nescessary,
providing the coders with the right data so it can be solved.
Just for good measures here is a lot of details:
# uname -a
Linux arwen.waakhond.net 2.6.9-34.EL.XOS.1 #1 Sat Mar 11 06:39:32 CET 2006 i686 athlon i386 GNU/Linux
# rpm -qa|grep nagios
nagios-plugins-1.4.2-1.2.el4.rf
nagios-2.1-1.el4.rf
NagiosGrapher-1.5-1
define host {
name host-template
register 0
check_command check-host-alive
max_check_attempts 3
active_checks_enabled 1
# passive_checks_enabled 1
check_period 24x7
retain_status_information 1
retain_nonstatus_information 1
notification_interval 60
notification_period 24x7
notification_options d,u,r,f
notifications_enabled 1
}
define host {
use host-template
host_name arwen
alias arwen.waakhond.net
address 192.0.2.101
hostgroups waakhond-net
contact_groups waakhond-net
}
define service {
name service-template
register 0
max_check_attempts 3
normal_check_interval 1
retry_check_interval 1
active_checks_enabled 1
passive_checks_enabled 1
check_period 24x7
retain_status_information 1
retain_nonstatus_information 1
notification_interval 60
notification_period 24x7
notification_options w,u,c,r,f
notifications_enabled 1
}
define service {
use service-template
host_name arwen
service_description PING
check_command check_ping!10,20%!20,40%
contact_groups waakhond-net
}
# 'check_ping' command definition
define command{
command_name check_ping
command_line $USER1$/check_ping -H $HOSTADDRESS$ -w $ARG1$ -c
$ARG2$ -p 5
}
define host {
name mydyndns-template
register 0
check_command check-host-alive
max_check_attempts 1
active_checks_enabled 1
# passive_checks_enabled 1
check_period 24x7
retain_status_information 1
retain_nonstatus_information 1
notification_interval 60
notification_period 24x7
notification_options d,u,r,f
notifications_enabled 1
}
define host{
use mydyndns-template
host_name ns2.mydyndns.org
alias DNS server 2
address 204.13.249.82
hostgroups mydyndns-org
contact_groups mydyndns-org
}
define service{
use mydyndns-template
host_name ns2.mydyndns.org
service_description DNS
check_command check_dig!1!5!www.vanderkooij.org
contact_groups mydyndns-org
}
define service{
use mydyndns-template
host_name ns2.mydyndns.org
service_description PING
check_command check_ping!200,20%!500,60%
contact_groups mydyndns-org
}
define command{
command_name check_dig
command_line $USER1$/check_dig -H $HOSTADDRESS$
-w $ARG1$ -c $ARG2$ -l $ARG3$
}
# ---
# Nagios Grapher Main Configuration
# ---
define config {
pipe /var/spool/nagios/ngraph.pipe
buffer 1024
interface pipe
step 300
heartbeat 600
rrdpath /var/spool/nagios/rrd/
tmppath /tmp/nagiosgrapher/
serviceext_type MULTIPLE
serviceext_path /etc/nagios/service-extinfo
url /nagios/cgi-bin/graphs.cgi
nagios_config /etc/nagios/nagios.cfg
cgi_config /etc/nagios/cgi.cfg
icon_image_tag dot.gif' border="0"></a><A TARGET="_blank"
HREF="graphs.cgi?###URL###"><img src='###IMAGESRC###'
icon_image_src /nagios/images/logos/graph.png
log_file /var/log/nagios/ngraph.log
log_level 63
}
define ngraph{
service_name PING
graph_log_regex loss = ([0-9]+)
graph_value Loss
graph_units %
graph_legend Packet Lost
rrd_plottype LINE2
rrd_color ff0000
page Packet Lost
}
define ngraph{
service_name PING
graph_log_regex rta = ([0-9]+\.+[0-9]*) ms
graph_value RTA
graph_units ms
graph_legend Time to answer
rrd_plottype AREA
rrd_color 00A348
page RTA
}
--
I hate duplicates. Just reply to the relevant mailinglist.
hvdkooij at vanderkooij.org http://hvdkooij.xs4all.nl/
Don't meddle in the affairs of magicians,
for they are subtle and quick to anger.
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list