Elasticsearch alert AggregatedLoggingSystemCPUHigh in OCP 4
Environment
- Red Hat OpenShift Container Platform (RHOCP)
- 4
- Red Hat OpenShift Logging
- 5
Issue
- Elasticsearch alert
AggregatedLoggingSystemCPUHighis triggered even when settinglimits.cpu, but not all cpu used - Able to see in alertmanager the alert for Elasticsearch
AggregatedLoggingSystemCPUHigh, even when reviewing the metrics, not all cpu is used.
Resolution
The requests.cpu value should be adjusted to be a value close to the real CPU usage of the Elasticsearch pods.
The monitoring stack or the Elasticsearch Dashboard can be used to check the Elasticsearch pods CPU usage.
Root Cause
This issue is in relation with the explained in the article: " CPU Throttling even when the container does not reach its CPU Limit ".
Diagnostic Steps
-
Check that one alert
AggregatedLoggingSystemCPUHighis triggered for Elasticsearch -
Verify that the
log storecomponent from the cluster logging stack was set withlimits.cpu:$ oc -n openshift-logging get clusterlogging -o jsonpath="{.items[*].spec.logStore.elasticsearch.resources.limits.memory}" 16Gi -
Verify in Prometheus or OCP Console that the Elasticsearch pods are not using all CPU.
This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.