High CPU consumption by java and redis-server
Rahul Amaram
rahul.amaram at vizury.com
Wed Sep 17 21:18:44 CEST 2014
/When it comes to your last finding I have no explanation. Just to
understand you compare using -24H with -10080M (-168H). Would it not be
better to compare -24H and -1440M. I have to get back to you on this but
I would need to get the result when running in cacheCli since you get
the time it takes,
http://www.bischeck.org/wp-content/uploads/2014/06/Bischeck_installation_and_administration_guide.html#toc-Section-4.4.//
///
This was a typo. I was talking about -168H and -10080M. Also, I used
"bischeck cli.CacheCli" to check this. And I re-ran this now, but not
finding much difference between both of them (it takes about 4-6 seconds
to retrieve the value).
Reg. other points, I have to get back to you. On a side note, I have
upgraded from redis-server 2.6 to 2.8, just to rule out any version
performance issues.
Thanks,
Rahul.
On Thursday 18 September 2014 12:19 AM, Anders Håål wrote:
> Hi Rahul,
> Looking at your threshold this means that you will retrieve max 6
> values, which should not be that "hard" even if its a time based query
> - using index is faster and is something we will look into in the future.
> Since you run the query every 120 sec it means that you currently have
> at lest 5040 items in the cache for this each service, which does not
> sound to bad. 10 services at least 50000 in total.
> What I like you to check is the following:
> - If you connect with some JMX client against bischeck you can see all
> the different timers
> http://www.bischeck.org/wp-content/uploads/2014/06/Bischeck_installation_and_administration_guide.html#toc-Chapter-5.
> The once that are related to threshold are inserting to start with but
> check all the different timers if some one have long execution time.
> - Since the its the redis-server that are consume a high level of CPU
> its interesting to see the configuration for redis - like the amount
> of memory allocated. If redis need to swap its not good.
> - Please check the redis log files.
> - You can also connect to redis with redis-cli and run command
> "monitor" to get a real time listing on the commands executed against
> redis.
> - Also check with top the percentage of %wa, waiting for io. How much
> memory do you have on the server? Only running bischeck and redis?
> - How much cpu is bischeck consuming? Do you see any peaks?
> - Also check the bischeck log to see any ERROR or WARN.
> - And finally - has this been the behavior from the beginning or has
> it increased over time? What happen if you restart bischeck (not reload)?
>
> Try to collect some more info so we can try to determine where the
> issue is related.
>
> When it comes to your last finding I have no explanation. Just to
> understand you compare using -24H with -10080M (-168H). Would it not
> be better to compare -24H and -1440M. I have to get back to you on
> this but I would need to get the result when running in cacheCli since
> you get the time it takes,
> http://www.bischeck.org/wp-content/uploads/2014/06/Bischeck_installation_and_administration_guide.html#toc-Section-4.4.
>
>
> Regards
> Anders
>
>
>
>
>
> On 09/17/2014 07:13 PM, Rahul Amaram wrote:
>> Hi,
>> I am observing very high CPU consumption by the java process and
>> redis-server. redis-server being single threaded it self is taking
>> 100% CPU. I have about 10 hosts, with about 10 services each (with
>> one service item per service). The time interval for generation of
>> value is 120s. The threshold that I have defined is:
>>
>> avg($$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-24H],$$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-96H],$$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-168H],$$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-336H],$$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-504H],$$HOSTNAME$$-$$SERVICENAME$$-$$SERVICEITEMNAME$$[-672H])
>>
>>
>> However, currently, not more than 3 values, are available.
>>
>> I am already running this on a c3.xlarge machine (4 cores) and the
>> load average is quite often > 4 resulting in delay of generation of
>> values. Any pointers in what could be causing the high load would be
>> much appreciated.
>>
>> On a slightly different note, while using cli.CacheCli, retrieving
>> the value of a service item one week back using hours (-24H) is
>> considerably faster than retrieving it using minutes (-10080M).
>> Again, why does bischeck behave this way?
>>
>> Thanks,
>> Rahul.
>>
>
>
--
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-lists.org/archive/bischeck-users/attachments/20140918/76b42255/attachment.html>
More information about the Bischeck-users
mailing list