Distributed nagios problem - service definition not found!
Jan Scholten
Jan.Scholten at iconz.net
Wed Oct 6 22:51:05 CEST 2004
Can you supply the relevant part of services.cfg?
It seems you have a misconfiguration. Are you sure the service is Check
Host Alive and not PING (like default)?
I don't know whether Nagios likes a servie_name with a blank, so try it
without!
So the return value ServiceName("Check Host Alive" in your case) must be
the same as your service_description in the services.cfg for that host.
Jan
> I just recently setup distributed nagios. I followed the directions
> very closely. I first had a problem running the nsca daemon through
> xinetd. It just wouldn't listen for incoming on 5667. I added the line
> to /etc/services also. Here is my config:
> service nsca
> {
> flags = REUSE
> socket_type = stream
> wait = no
> user = nagios
> group = nagios
> server = /usr/local/nagios/bin/nsca
> server_args = -c /usr/local/nagios/etc/nsca.cfg
> log_on_failure += USERID
> disable = no
> only_from = 10.12.225.50
> }
>
> If I run it from command line in daemon mode it works fine.
> My main problem, is that when passive checks are sent to the central
> server I keep getting this error:
> Oct 6 15:02:28 noc-mon nsca[31620]: Connection from 10.12.225.50 port
> 38784
> Oct 6 15:02:28 noc-mon nsca[31620]: Host address checks out ok
> Oct 6 15:02:28 noc-mon nsca[31620]: Handling the connection...
> Oct 6 15:02:29 noc-mon nsca[31620]: SERVICE CHECK -> Host Name:
> 'localhost', Service Description: 'cpu', Return Code: '0', Output: 'OK -
> load average: 0.00, 0.00, 0.00'
> Oct 6 15:02:29 noc-mon nsca[31620]: End of connection...
> Oct 6 15:02:30 noc-mon nagios: EXTERNAL COMMAND:
> PROCESS_SERVICE_CHECK_RESULT;localhost;cpu;0;OK - load average: 0.00,
> 0.00, 0.00
> Oct 6 15:02:39 noc-mon nsca[31817]: Connection from 10.12.225.50 port
> 39040
> Oct 6 15:02:39 noc-mon nsca[31817]: Host address checks out ok
> Oct 6 15:02:39 noc-mon nsca[31817]: Handling the connection...
> Oct 6 15:02:40 noc-mon nsca[31817]: SERVICE CHECK -> Host Name:
> 'acdmz-inside-sw2', Service Description: 'Check Host Alive', Return
> Code: '0', Output: 'PING OK - Packet loss = 0%, RTA = 0.83 ms'
> Oct 6 15:02:40 noc-mon nsca[31817]: End of connection...
> Oct 6 15:02:40 noc-mon nagios: EXTERNAL COMMAND:
> PROCESS_SERVICE_CHECK_RESULT;acdmz-inside-sw2;Check Host Alive;0;PING OK
> - Packet loss = 0%, RTA = 0.83 ms
> Oct 6 15:02:44 noc-mon nagios: Warning: Message queue contained
> results for service 'Check Host Alive' on host 'acdmz-inside-sw2'. The
> service could not be found!
>
> The localhost check acts like it works, but the simple check-host-alive
> service definition is not. I know that that service definition is on
> both servers. They are both running v1.2. Also, should I see something
> on my central server's web gui showing these hosts down? My host count
> has not been affected at all by the hosts added to the distributed
> server. Am I missing something? Is there something wrong with the
> default check-host-alive service check?
> Thanks for any help,
> --------------------------------------
> Tim Moore
> DNS/Linux/Cisco Admin
> ODJFS
>
--
Jan Scholten
Research and Development Intern
Iconz.co.nz
-------------------------------------------------------
This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
Use IT products in your business? Tell us what you think of them. Give us
Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
http://productguide.itmanagersjournal.com/guidepromo.tmpl
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list