check_disk and stale NFS
VALIYEV Ruslan
valiyev at unicc.org
Mon Nov 5 16:56:15 CET 2012
Hi all,
I would like to monitor all local and non-local filesystems on a server + check for stale NFS mount points. And according to man, the option I need is the following:
-L, --stat-remote-fs
Only check local filesystems against thresholds. Yet call stat on remote filesystems
to test if they are accessible (e.g. to detect Stale NFS Handles)
However, it looks like check_disk is simply hanging when there's a stale NFS.
[root at server ~]# /usr/lib64/nagios/plugins/check_disk -L -t 5
...hang state...
Here's /etc/mtab
[root at server ~]# grep nfs /etc/mtab
sunrpc /var/lib/nfs/rpc_pipefs rpc_pipefs rw 0 0
10.111.11.10:/icc_hds_transfer /icc_hds_transfer nfs rw,bg,addr=10.111.11.10 0 0
10.111.11.10:/ora_disk_backup /ora_disk_backup nfs rw,bg,hard,nointr,rsize=32768,wsize=32768,tcp,nfsvers=3,timeo=600,actimeo=0,addr=10.111.11.10 0 0
10.111.11.10:/icc_fast_refresh /icc_fast_refresh nfs rw,bg,addr=10.111.11.10 0 0
And here's df in hang state
[root at server ~]# df -P
Filesystem 1024-blocks Used Available Capacity Mounted on
/dev/sda2 11904620 3857608 7432528 35% /
/dev/mapper/Volsda1-LogVol02 7935392 4927660 2598132 66% /home
/dev/mapper/Volsda1-LogVol01 7935392 439296 7086496 6% /opt
/dev/sda5 7936256 151484 7375120 3% /tmp
/dev/sda3 7936288 927140 6599492 13% /var
/dev/sda1 350018 16903 315044 6% /boot
tmpfs 16474872 0 16474872 0% /dev/shm
/dev/sdb1 287826944 187056292 86149900 69% /u01
/dev/sdc1 287826944 172147168 101059024 64% /oratemp
...hang state...
[root at server ~]# grep nfs /var/log/messages
Nov 5 16:40:05 server kernel: nfs: server 10.111.11.10 not responding, still trying
What am I doing wrong? I just tried the latest nagios-plugins snapshot and the same thing happened.
Best regards,
Ruslan Valiyev
.
..:
Linux System Administrator
Systems Support Unit (OTAS)
Operations and Technology Division
International Computing Centre (ICC)
Email: valiyev at unicc.org
Website: www.unicc.org
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20121105/90d51255/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
LogMeIn Central: Instant, anywhere, Remote PC access and management.
Stay in control, update software, and manage PCs from one command center
Diagnose problems and improve visibility into emerging IT issues
Automate, monitor and manage. Do more in less time with Central
http://p.sf.net/sfu/logmein12331_d2d
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list