solutions for off-server PNP4Nagios perfdata processing?
Frost, Mark {BIS}
mark.frost1 at pepsico.com
Wed Oct 3 17:58:20 CEST 2012
Dan,
As I understand it, the issue is less about Nagios and more about npcd. Nagios merrily produces the perfdata files and then npcd comes along and scoops them up, but as it's processing them it's opening a lot of rrd files and inserting data into them. So really it's npcd that's the problem. Well, not really a problem, but ultimately it's doing its thing and then Nagios gets less than a fair share of the box's I/O. It's not that it's horrible right now, but we're starting to notice it and I would tend to be concerned about scaling problems.
Honestly even with Nagios 3, it seems like Nagios' own I/O is entirely manageable so far with strategic use of ram disk. It's just putting Nagios and PNP4Nagios (plus Apache to serve up the graph contents which I'm also not happen going on on the same server) on the same boxes that I don't like.
Hmm. I was unaware that rrdcached could be configured to receive data over the network. I'm assuming that means that npcd can be configured to send. I'll check that out. Still doesn't feel like an elegant solution, but it may fit the bill.
Thanks
Mark
From: Daniel Wittenberg [mailto:daniel.wittenberg.r0ko at statefarm.com]
Sent: Wednesday, October 03, 2012 11:08 AM
To: Nagios Users List
Subject: Re: [Nagios-users] solutions for off-server PNP4Nagios perfdata processing?
You might consider looking at 4.0 since disk i/o is almost nothing, but short of that looked at using rrdcache to send the processing to another server?
Dan
On Oct 3, 2012, at 9:33 AM, Frost, Mark {BIS} wrote:
Hello. Has anyone come up with solutions for processing Nagios performance data on a server other than a Nagios server? We've been processing perfdata results on our Nagios server(s) for a while now and increasingly it's just eating up too much I/O to make me comfortable.
Yes, we do use rrdcached and yes, I realize that shuffling data around on different disk spindles and controllers would help, but in today's world where companies don't like building any kind of physical server let alone one with all that additional hardware, that's not entirely an option for us.
I realize that once the perfdata files are on the dedicated graphing server(s), processing them into RRD files there should be a no-brainer. My problem is figuring out how to get them there without say, using a NAS device. (If I/O's a problem locally, I don't want to shuffle that I/O to an even slower network device).
It would be ideal if somehow there was a process that I could just send that data to and have it picked up remotely. Like if maybe Merlin have a special kind of peer that just received a stream of perfdata or something. Anything else I could imagine would be some kind of home-grown solution like say pumping events into a messaging system from the Nagios server(s) and then letting the graphing server pick them up from the message queue(s). I could also imagine some kind of fancy-pants module in Nagios 4 that did something like this, maybe.
Any thoughts would be appreciated.
Thanks
Mark
------------------------------------------------------------------------------
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net<mailto:Nagios-users at lists.sourceforge.net>
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/users/attachments/20121003/dfd05ad1/attachment.html>
-------------- next part --------------
------------------------------------------------------------------------------
Don't let slow site performance ruin your business. Deploy New Relic APM
Deploy New Relic app performance management and know exactly
what is happening inside your Ruby, Python, PHP, Java, and .NET app
Try New Relic at no cost today and get our sweet Data Nerd shirt too!
http://p.sf.net/sfu/newrelic-dev2dev
-------------- next part --------------
_______________________________________________
Nagios-users mailing list
Nagios-users at lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nagios-users
::: Please include Nagios version, plugin version (-v) and OS when reporting any issue.
::: Messages without supporting info will risk being sent to /dev/null
More information about the Users
mailing list