check_procs problems
Voon, Ton
Ton.Voon at egg.com
Tue May 20 15:16:14 CEST 2003
You need to use check_nrpe from the central Nagios server to your remote
host. Your check_command is running check_procs on the central Nagios
server, not on the remote host. You need on the Nagios server something
like:
define service{
use generic-service
host_name host
service_description MEMORY
check_command check_nrpe!check_swap
}
define command{
command_name check_nrpe
command_line $USER1$/check_nrpe -H $HOSTADDRESS$ -to 30 -c $ARG1$
}
And, on the remote host, nrpe.cfg something like:
command[check_swap]=/usr/local/nagios/libexec/check_swap -w 100000 -c 50000
Ton
> -----Original Message-----
> From: Webster, Stuart SITI-ITDIUR [SMTP:Stuart.Webster at shell.com]
> Sent: Tuesday, May 20, 2003 1:53 PM
> To: Voon, Ton; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] check_procs problems
>
> Yeah i knew that (if i understand what your saying ;-) ), nrpe is running
> on my remote host,, and its checking /usr/sbin/inetd just fine.
>
> OK - 1 processes running with command name /usr/sbin/inetd
> but I still get
> CRITICAL - 0 processes running with command name /opt/CTXSmf/slib/ctxXtw
>
> even though multiple instances are running.
>
> -----Original Message-----
> From: Voon, Ton [mailto:Ton.Voon at egg.com]
> Sent: 20 May 2003 14:44
> To: Webster, Stuart SITI-ITDIUR; nagios-users at lists.sourceforge.net
> Subject: RE: [Nagios-users] check_procs problems
>
>
> Ah-ha, making sense now. Do you realise plugins are run locally? You need
> nrpe or check_by_ssh to run a command on a remote server.
>
> > -----Original Message-----
> > From: Webster, Stuart SITI-ITDIUR [SMTP:Stuart.Webster at shell.com]
> > Sent: Tuesday, May 20, 2003 1:39 PM
> > To: Voon, Ton; nagios-users at lists.sourceforge.net
> > Subject: RE: [Nagios-users] check_procs problems
> >
> > Im baffled now
> > ./check_procs -C /opt/CTXSmf/slib/ctxXtw
> > OK - 17 processes running with command name /opt/CTXSmf/slib/ctxXtw
> >
> > that works when run on rempte hosts command line,, so there MUST be
> > something with my config on the nagios server. but i cant spot it!!!!!!!
> >
> > # 'check_procs' command definition
> > define command{
> > command_name check_procs
> > command_line $USER1$/check_procs -c $ARG1$ -C $ARG2$
> > }
> >
> >
> > # Service definition
> > define service{
> > use generic-service ; Name
> of
> > service template to use
> > host_name rtssip33
> > service_description Check Citrix Daemon
> > is_volatile 0
> > check_period 24x7
> > max_check_attempts 3
> > normal_check_interval 15
> > retry_check_interval 1
> > contact_groups nagios-admins
> > notification_interval 120
> > notification_period 24x7
> > notification_options w,u,c,r
> > check_command
> > check_procs!1:1!/opt/CTXSmf/slib/ctxXtw
> > }
> >
> > -----Original Message-----
> > From: Voon, Ton [mailto:Ton.Voon at egg.com]
> > Sent: 20 May 2003 14:24
> > To: Webster, Stuart SITI-ITDIUR; nagios-users at lists.sourceforge.net
> > Subject: RE: [Nagios-users] check_procs problems
> >
> >
> > Everything looks right, so I'm baffled. Couple of options:
> >
> > 1) check_procs -C /opt/CTXSmf/slib/ctxXtw should work
> > 2) newer r1_3_0 branch at http://nagiosplug.sf.net/snapshot. Was an
> > enhancement to -a argument, as r1.3.0 only matches from beginning of
> > argument
> > 3) use HEAD snapshot. check_procs -v -v -v will give maximum debug
> output
> > 4) add debug statements in check_procs.c to work out why it is being
> > rejected
> >
> > I bet it is something really obvious...
> >
> > Ton
> >
> > > -----Original Message-----
> > > From: Webster, Stuart SITI-ITDIUR [SMTP:Stuart.Webster at shell.com]
> > > Sent: Tuesday, May 20, 2003 12:34 PM
> > > To: Voon, Ton; nagios-users at lists.sourceforge.net
> > > Subject: RE: [Nagios-users] check_procs problems
> > >
> > >
> > >
> > > Here is a small portion of the output, looks to me like i should be
> > > looking for thesame string as with ps -ef
> > >
> > > ps -Ao 's uid ppid comm args' | grep ctxXtw
> > > S 1910 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 1022 -terminate -notransfills
> > > -frameexpose -no
> > > S 1910 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 207 -terminate -notransfills
> > -frameexpose
> > > -not
> > > S 1949 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 1034 -terminate -notransfills
> > > -frameexpose -no
> > > S 4219 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 1044 -terminate -notransfills
> > > -frameexpose -no
> > > S 1949 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 772 -terminate -notransfills
> > -frameexpose
> > > -not
> > > S 19711 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 1050 -terminate -notransfills
> > > -frameexpose -no
> > > S 5316 335 /opt/CTXSmf/slib/ctxXtw
> > > /opt/CTXSmf/slib/ctxXtw -session 1084 -terminate -notransfills
> > > -frameexpose -no
> > >
> > > /opt/CTXSmf/slib/ctxXtw -session
> > > S 19711 21016 grep
> > > grep ctxXtw
> > >
> > > -----Original Message-----
> > > From: Voon, Ton [mailto:Ton.Voon at egg.com]
> > > Sent: 20 May 2003 13:30
> > > To: Webster, Stuart SITI-ITDIUR; Voon, Ton;
> > > nagios-users at lists.sourceforge.net
> > > Subject: RE: [Nagios-users] check_procs problems
> > >
> > >
> > > Try on the command line first, to remove the definitions out of the
> > loop.
> > > I
> > > get on SunOS 5.6:
> > >
> > > $ ps -Ao 's uid ppid comm args' | grep ttymon
> > > S 0 1 /usr/lib/saf/ttymon
> > > /usr/lib/saf/ttymon -g -h -p snail console login: -T sun -d
> > /dev/console
> > > -l
> > > co
> > > S 0 404 /usr/lib/saf/ttymon
> > > /usr/lib/saf/ttymon
> > > $ ./check_procs -c 1:20 -a "/usr/lib/saf/ttymon -g "
> > > OK - 1 processes running with args /usr/lib/saf/ttymon -g
> > >
> > > What do you get with:
> > >
> > > ps -Ao 's uid ppid comm args' | grep ctxXtw
> > >
> > > ?
> > >
> > > Ton
> > >
> > > > -----Original Message-----
> > > > From: Webster, Stuart SITI-ITDIUR [SMTP:Stuart.Webster at shell.com]
> > > > Sent: Tuesday, May 20, 2003 12:02 PM
> > > > To: Voon, Ton; nagios-users at lists.sourceforge.net
> > > > Subject: RE: [Nagios-users] check_procs problems
> > > >
> > > > Thanks for reply Ton,,
> > > >
> > > > still no luck though
> > > >
> > > > here is my config
> > > >
> > > > # Service definition
> > > > define service{
> > > > use generic-service ;
> Name
> > > of
> > > > service template to use
> > > > host_name citric-server
> > > > service_description Check Citrix Daemon
> > > > is_volatile 0
> > > > check_period 24x7
> > > > max_check_attempts 3
> > > > normal_check_interval 15
> > > > retry_check_interval 1
> > > > contact_groups nagios-admins
> > > > notification_interval 120
> > > > notification_period 24x7
> > > > notification_options w,u,c,r
> > > > check_command
> > > > check_procs_array!1:100!"/opt/CTXSmf/slib/ctxXtw -session"
> > > > }
> > > >
> > > > # 'check_procs_array' command definition
> > > > define command{
> > > > command_name check_procs_array
> > > > command_line $USER1$/check_procs -c $ARG1$ -a $ARG2$
> > > > }
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > CRITICAL - 0 processes running with args /opt/CTXSmf/slib/ctxXtw
> > > -session
> > > >
> > > > BUT these services are running, lots of them in fact
> > > >
> > > > # ./check_procs -v
> > > > /usr/bin/ps -Ao 's uid ppid comm args'
> > > > OK - 68 processes running
> > > >
> > > > My version of ps is the standard solaris 8 version
> > > >
> > > >
> > > > Cheers, help appreciated
> > > >
> > > > Stuart
> > > >
> > > >
> > > >
> > > >
> > > >
> > > > -----Original Message-----
> > > > From: Voon, Ton [mailto:Ton.Voon at egg.com]
> > > > Sent: 20 May 2003 12:49
> > > > To: Webster, Stuart SITI-ITDIUR; nagios-users at lists.sourceforge.net
> > > > Subject: RE: [Nagios-users] check_procs problems
> > > >
> > > >
> > > > Which version of the plugins?
> > > >
> > > > If you run check_procs -v, it will show the ps command used. There
> > maybe
> > > > differences between the command used and ps -ef that you have used
> for
> > > the
> > > > 2nd case below.
> > > >
> > > > For the 3rd case, -a matches substrings, so just use the start of
> the
> > > > command "/opt/CTXSmf/slib/ctxXtw -session".
> > > >
> > > > Ton
> > > >
> > > > > -----Original Message-----
> > > > > From: Webster, Stuart SITI-ITDIUR [SMTP:Stuart.Webster at shell.com]
> > > > > Sent: Tuesday, May 20, 2003 10:03 AM
> > > > > To: nagios-users at lists.sourceforge.net
> > > > > Subject: [Nagios-users] check_procs problems
> > > > >
> > > > > Hello
> > > > >
> > > > > I am trying to monitor 3 Processes with nagios.
> > > > >
> > > > > /usr/sbin/inetd
> > > > > pbs_server
> > > > >
> > > > > /opt/CTXSmf/slib/ctxXtw -session 835 -terminate -notransfills
> > > > -frameexpose
> > > > > -not
> > > > >
> > > > >
> > > > > I have tried the following.
> > > > >
> > > > >
> > > > > check_procs -c 1:1 -C /usr/sbin/inetd this works
> > fine
> > > > "OK
> > > > > - 1 processes running with command name /usr/sbin/inetd"
> > > > >
> > > > >
> > > > > check_procs -c 1:1 -C pbs_server this reports "CRITICAL
> -
> > 0
> > > > > processes running with command name pbs_server"
> > > > > but ps -ef|grep pbs_server reports "root 19925 1 0
> > > May
> > > > 07
> > > > > ? 0:25 pbs_server"
> > > > > so the service is up, but check_procs is not seeing it????? any
> > > ideas?
> > > >
> > > > >
> > > > > also the third process i dont know how to check as the number in
> the
> > > > > command line changes with every instance of the process, so
> > > > >
> > > > > check_procs -c 1:20 -a "/opt/CTXSmf/slib/ctxXtw -session 835
> > > -terminate
> > > > > -notransfills -frameexpose -not" this will not work as
> > only
> > > 1
> > > > > process has that number.
> > > > >
> > > > > Any help from much appreciated.
> > > > >
> > > > >
> > > > > Stuart
> > >
> >
> >
> > This private and confidential e-mail has been sent to you by Egg.
> > The Egg group of companies includes Egg Banking plc
> > (registered no. 2999842), Egg Financial Products Ltd (registered
> > no. 3319027) and Egg Investments Ltd (registered no. 3403963) which
> > carries out investment business on behalf of Egg and is regulated
> > by the Financial Services Authority.
> > Registered in England and Wales. Registered offices: 1 Waterhouse
> Square,
> > 138-142 Holborn, London EC1N 2NA.
> > If you are not the intended recipient of this e-mail and have
> > received it in error, please notify the sender by replying with
> > 'received in error' as the subject and then delete it from your
> > mailbox.
-------------------------------------------------------
This SF.net email is sponsored by: ObjectStore.
If flattening out C++ or Java code to make your application fit in a
relational database is painful, don't do it! Check out ObjectStore.
Now part of Progress Software. http://www.objectstore.net/sourceforge
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list