Yet another NRPE issue...

andrew.slater at netstrategy.net andrew.slater at netstrategy.net
Tue Dec 3 07:06:34 CET 2002


Hi all,

The story goes like this;

I had my Nagios setup and working beautifully with NRPE working fine on 20
odd hosts. I went away for a couple of weeks and as the system is somewhat
developmental at the moment, was happy enough to have it shutdown for the
time I was away. The problem is that now I am back, the system is plugged
back in I cant get any response from any server with NRPE on it.

The details;

- check_nrpe run on the target machine works fine
- Nagios called check_nrpe command fails with "CHECK_NRPE: Received 0
bytes. Are we allowed to connect to the host?"
- telnet to port 5666 on target machine from Nagios machine fails


- telnet to localhost:5666 on target machine fails;
[root at cache01 xinetd.d]# telnet
telnet> open xxx.xxx.xxx.xxx 5666
Trying xxx.xxx.xxx.xxx...
Connected to xxx.xxx.xxx.xxx.
Escape character is '^]'.
Connection closed by foreign host. (Which happens instantaneously)


- xinetd _is_ listening on port 5666 on the target machines;
[root at cache01 xinetd.d]# netstat -pan | grep
tcp        0      0 0.0.0.0:5666            0.0.0.0:*               LISTEN
5320/xinetd


- strace printout of a commandline "./check_nrpe xxx.xxx.xxx.xxx -p 5666 -c
check_users" to a machine on the same subnet
...
...
send(3, "\0\0\0\1\0\0\0\1\0\0\0\0\0\0\0\vcheck_users\0\0\0\0\0"..., 1040,
0) = 1040
time([1038894731])                      = 1038894731
recv(3, 0xbffff2e0, 1040, 0)            = -1 ECONNRESET (Connection reset
by peer)
fstat64(1, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 1), ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0)
= 0x40014000
write(1, "CHECK_NRPE: Error receiving data"..., 44CHECK_NRPE: Error
receiving data from host.
) = 44
close(3)                                = 0
alarm(0)                                = 10
munmap(0x40014000, 4096)                = 0
_exit(3)                                = ?



- strace printout of a commandline "./check_nrpe xxx.xxx.xxx.xxx -p 5666 -c
check_users" to a machine on a different subnet
...
...
send(3, "\0\0\0\1\0\0\0\1\0\0\0\0\0\0\0\vcheck_users\0\0\0\0\0"..., 1040,
0) = 1040
time([1038894876])                      = 1038894876
recv(3, "", 1040, 0)                    = 0
fstat64(1, {st_mode=S_IFCHR|0620, st_rdev=makedev(136, 1), ...}) = 0
mmap2(NULL, 4096, PROT_READ|PROT_WRITE, MAP_PRIVATE|MAP_ANONYMOUS, -1, 0)
= 0x40014000
write(1, "CHECK_NRPE: Received 0 bytes.  A"..., 70CHECK_NRPE: Received 0
bytes.
 Are we allowed to connect to the host?
) = 70
close(3)                                = 0
alarm(0)                                = 10
munmap(0x40014000, 4096)                = 0
_exit(3)                                = ?


I have to admit that i'm still pretty much in holiday mood so there could
be something ridiculously simple I am missing (here's hoping) but I am
getting to the end of my useful troubleshooting ideas, so any light from
any listmembers would be greatly appreciated.

Thanks to all,

Andrew Slater




-------------------------------------------------------
This SF.net email is sponsored by: Get the new Palm Tungsten T 
handheld. Power & Color in a compact size! 
http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en




More information about the Users mailing list