check_rsync: (Service check did not exit properly)
Ryan Rawdon
ryan at u13.net
Tue Jun 8 01:39:44 CEST 2010
Hey everyone,
I recently installed Nagios on a new system and migrated all of my
configs and everything over. Everything is working fine, except for
check_rsync, which fails on all hosts with "(Service check did not exit
properly)."
I have tried disabling the embedded perl interpreter (even though it
worked fine in the last install), but that didn't appear to do anything
except change the output to "null".
Running the plugin by hand works 100% fine.
What else should I check? Here's the output with embedded perl (which
is one of the things that led me to believe it was an embedded perl issue)
[1275952035.062527] [016.0] [pid=2637] Attempting to run scheduled check
of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check
options=0, latency=0.062000
[1275952035.062585] [016.0] [pid=2637] Checking service 'RSync Server'
on host 'vm-mx01.puttynuts.com'...
[1275952035.062635] [2320.2] [pid=2637] Raw Command Input:
$USER1$/check_rsync -H $HOSTADDRESS$
[1275952035.062656] [2320.2] [pid=2637] Expanded Command Output:
$USER1$/check_rsync -H $HOSTADDRESS$
[1275952035.062782] [016.1] [pid=2637] Check result output will be
written to '/var/lib/nagios3/spool/checkresults/checkg48PqE' (fd=7)
[1275952035.062934] [016.1] [pid=2637] ** Using Embedded Perl
interpreter to run service check...
[1275952035.063208] [016.0] [pid=2637] Embedded Perl failed to compile
/usr/lib/nagios/plugins/check_rsync, compile error **ePN failed to
compile /usr/lib/nagios/plugins/chec$
BEGIN failed--compilation aborted at (eval 1) line 39." at
/usr/lib/nagios3/p1.pl line 161.
- skipping plugin
Here's the output with embedded perl turned up (and I've since cranked
up the logging level a bit):
[1275953233.045782] [016.0] [pid=25338] Attempting to run scheduled
check of service 'RSync Server' on host 'vm-mx01.puttynuts.com': check
options=0, latency=0.045000
[1275953233.045923] [016.0] [pid=25338] Checking service 'RSync Server'
on host 'vm-mx01.puttynuts.com'...
[1275953233.046002] [2320.2] [pid=25338] Raw Command Input:
$USER1$/check_rsync -H $HOSTADDRESS$
[1275953233.046013] [2320.2] [pid=25338] Expanded Command Output:
$USER1$/check_rsync -H $HOSTADDRESS$
[1275953233.046183] [016.1] [pid=25338] Check result output will be
written to '/var/lib/nagios3/spool/checkresults/check6XvsV0' (fd=7)
[1275953233.047328] [016.2] [pid=25338] Service check is executing in
child process (pid=26948)
[1275953233.098575] [016.2] [pid=26949] Moving temp check result file
'/var/lib/nagios3/spool/checkresults/check6XvsV0' to queue file
'/var/lib/nagios3/spool/checkresults/cpt5vVV'...
and another excerpt:
[1275953511.122166] [016.1] [pid=25338] Handling check result for
service 'RSync Server' on host 'jester.u13.net'...
[1275953511.122173] [016.0] [pid=25338] ** Handling check result for
service 'RSync Server' on host 'jester.u13.net'...
[1275953511.122179] [016.1] [pid=25338] HOST: jester.u13.net, SERVICE:
RSync Server, CHECK TYPE: Active, OPTIONS: 0, SCHEDULED: Yes,
RESCHEDULE: Yes, EXITED OK: Yes, RETURN CODE: 2, OUTPUT: (null)
[1275953511.122209] [016.2] [pid=25338] Parsing check output...
[1275953511.122215] [016.2] [pid=25338] Short Output: (null)
[1275953511.122221] [016.2] [pid=25338] Long Output: NULL
[1275953511.122227] [016.2] [pid=25338] Perf Data: NULL
[1275953511.122233] [016.2] [pid=25338] ST: HARD CA: 3 MA: 3 CS: 2
LS: 2 LHS: 2
[1275953511.122240] [016.1] [pid=25338] Service is in a non-OK state!
[1275953511.122246] [016.1] [pid=25338] Host is currently UP, so we'll
recheck its state to make sure...
[1275953511.122252] [016.1] [pid=25338] * Using last known host state: 0
[1275953511.122261] [016.1] [pid=25338] Current/Max Attempt(s): 3/3
[1275953511.122267] [016.1] [pid=25338] Service has reached max number
of rechecks, so we'll handle the error...
[1275953511.122274] [016.1] [pid=25338] Checking service 'RSync Server'
on host 'jester.u13.net' for flapping...
[1275953511.122280] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00,
PSC=0.00%
[1275953511.122288] [016.1] [pid=25338] Service is not flapping (0.00%
state change).
[1275953511.122294] [016.1] [pid=25338] Checking host 'jester.u13.net'
for flapping...
[1275953511.122308] [016.2] [pid=25338] LFT=5.00, HFT=20.00, CPC=0.00,
PSC=0.00%
[1275953511.122316] [016.1] [pid=25338] Host is not flapping (0.00%
state change).
[1275953511.122360] [016.1] [pid=25338] Rescheduling next check of
service at Mon Jun 7 23:36:45 2010
[1275953511.122380] [016.0] [pid=25338] Scheduling a non-forced, active
check of service 'RSync Server' on host 'jester.u13.net' @ Mon Jun 7
23:36:45 2010
[1275953511.122389] [016.2] [pid=25338] Scheduling new service check event.
[1275953511.122409] [016.1] [pid=25338] Deleted check result file
'/var/lib/nagios3/spool/checkresults/cwgRtGi'
[1275953511.122416] [016.2] [pid=25338] Found a check result (#4) to
handle...
... and it is when it is running like this with embedded perl off which
results in "(null)" (instead of "(Service check did not exit
properly)")on my services summary page as well a critical state.
I look forward to hearing what suggestions you might have
Ryan
------------------------------------------------------------------------------
ThinkGeek and WIRED's GeekDad team up for the Ultimate
GeekDad Father's Day Giveaway. ONE MASSIVE PRIZE to the
lucky parental unit. See the prize list and enter to win:
http://p.sf.net/sfu/thinkgeek-promo
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list