A question...cascading failures and failure to recover

Josh Yost Josh.Yost at epsiia.com
Sat Feb 17 00:22:03 CET 2007


Steven Schwartz wrote:

> 1) On a given server, a plugin produces a "critical failure" on many
> (sometimes all) of the systems using that particular plugin.
> 
> 2) Tests by hand of said plugin produce an "OK" result.
> 
> 3) The system does not acknowledge the service having recovered until
> checks are rescheduled by force, and then execute OK.
> 

Hi,
	You'd have to say what the plugin is & what it's using to connect to
the remote hosts (SSH, SNMP, etc).  It could be you have some underlying
connectivity issues on your network.

- Josh

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue. 
::: Messages without supporting info will risk being sent to /dev/null





More information about the Users mailing list