Multipme nagios process and lost "nagios.cmd"
Gili Lapid
GiliL at sodaclub.co.il
Wed Dec 29 08:20:05 CET 2004
Hi All
If I do "ps -efw | grep nadios" I can see one or more nagios process.
<snip>
[root at nagios etc]# ps -efw | grep "nagios"
nagios 11458 1 0 16:54 ? 00:00:01 /usr/local/nagios/bin/nagios
-d /usr/local/nagios/etc/nagios.cfg
nagios 20148 1 0 18:03 ? 00:00:00 /usr/local/nagios/bin/nagios
-d /usr/local/nagios/etc/nagios.cfg
nagios 20149 20148 0 18:03 ? 00:00:00
/usr/local/nagios/libexec/check_ping -H 10.1.1.1 -w 100.0,20% -c 500.0,60%
-p
nagios 20150 20149 0 18:03 ? 00:00:00 /bin/ping -n -U -c 5
10.1.1.1
<snap>
They all have 1 in the PPID (The parent ID). some time I can see only one,
but when there are more then one I can see the plugins are running too...
Also I do not have the nagios.cmd in the var/rw folder. I read the manual in
this link and did as I told (restart the apache & nagios :-) at the end...),
and nothing...
http://nagios.sourceforge.net/docs/1_0/commandfile.html
<http://nagios.sourceforge.net/docs/1_0/commandfile.html>
Also, if a plugin have a "time out" status the "Current Attempt" stay at 1
(if let say I have 1 out of 5) and not sending allerts...
The system is up and running for a month or so whiteout any problems,
suddenly this started...
<snip>
[root at nagios etc]# ll /usr/local/nagios/var/
total 23572
drwxrwxr-x 2 nagios apache 4096 Dec 28 00:00 archives
-rw-rw-r-- 1 nagios nagios 0 Dec 28 16:54 comment.log
-rw-rw-r-- 1 nagios nagios 0 Dec 28 16:54 downtime.log
-rw-rw-r-- 1 nagios apache 417288 Dec 12 14:41 hostperf.log
-rw-r--r-- 1 root root 6 Dec 28 16:54 nagios.lock
-rw-r--r-- 1 nagios nagios 130742 Dec 28 17:54 nagios.log
-rw-r--r-- 1 root apache 122 Nov 29 17:20
perfparse.log.20041129.log
-rw-r--r-- 1 root root 248 Dec 12 16:58
perfparse.log.20041212.log
drwxrws--- 2 nagios nagiocmd 4096 Dec 28 16:06 rw
drwxr-xr-x 5 nagios apache 4096 Dec 27 16:10 sat
-rw-rw-r-- 1 nagios apache 23478843 Dec 12 14:44 serviceperf.log
-rw-rw-r-- 1 nagios nagios 23583 Dec 28 17:57 status.log
-rw-rw-r-- 1 nagios nagios 17437 Dec 28 17:54 status.sav
[root at nagios etc]# ll /usr/local/nagios/var/rw/
total 0
[root at nagios etc]# cat /etc/group | grep nagiocmd
nagiocmd:x:501:nagios,nobody,apache
[root at nagios etc]# df -h
Filesystem Size Used Avail Use% Mounted on
/dev/hdb1 3.8G 1.6G 2.0G 44% /
none 61M 0 61M 0% /dev/shm
/dev/hdb6 193M 5.0M 178M 3% /tmp
/dev/hdb5 6.7G 4.0G 2.3G 63% /usr
/dev/hdb3 1.9G 868M 1002M 47% /var
<snap>
TIA,
Gili
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20041229/fa5a3a02/attachment.html>
More information about the Users
mailing list