KubernetesHpaUnderutilized #

HPA {{ $labels.namespace }}/{{ $labels.horizontalpodautoscaler }} is constantly at minimum replicas for 50% of the time. Potential cost saving here.

Alert Rule

alert: KubernetesHpaUnderutilized
annotations:
  description: |-
    HPA {{ $labels.namespace }}/{{ $labels.horizontalpodautoscaler }} is constantly at minimum replicas for 50% of the time. Potential cost saving here.
      VALUE = {{ $value }}
      LABELS = {{ $labels }}    
  runbook: https://srerun.github.io/prometheus-alerts/runbooks/kubestate-exporter/kuberneteshpaunderutilized/
  summary: Kubernetes HPA underutilized (instance {{ $labels.instance }})
expr: max(quantile_over_time(0.5, kube_horizontalpodautoscaler_status_desired_replicas[1d])
  == kube_horizontalpodautoscaler_spec_min_replicas) by (horizontalpodautoscaler)
  &gt; 3
for: 0m
labels:
  severity: info

Here is a sample runbook for the KubernetesHpaUnderutilized alert:

Meaning #

The KubernetesHpaUnderutilized alert is triggered when a Horizontal Pod Autoscaler (HPA) is constantly at minimum replicas for 50% of the time, indicating potential underutilization of resources. This alert is triggered when the desired replicas of the HPA are equal to the minimum replicas for more than 50% of the time in the past 24 hours.

Impact #

Underutilization of resources (e.g., CPU, memory) allocated to the pods
Potential cost savings by right-sizing the resources
Inefficient use of cluster resources, which can lead to resource shortages for other applications

Diagnosis #

To diagnose the issue, follow these steps:

Check the HPA configuration to ensure that the minimum replicas are set correctly.
Verify that the application is not experiencing high traffic or usage patterns that require more resources.
Investigate the pod utilization metrics (e.g., CPU, memory) to determine if the pods are indeed underutilized.
Check for any recent changes to the application or deployment that may have caused the underutilization.

Mitigation #

To mitigate the issue, follow these steps:

Review and adjust the HPA configuration to set a more optimal minimum replica count.
Consider implementing a more dynamic scaling strategy based on resource utilization metrics.
Right-size the resources allocated to the pods to match the actual usage patterns.
Consider implementing cost-saving measures, such as resource pooling or bin packing, to optimize resource utilization.
Monitor the HPA and pod utilization metrics regularly to detect any future underutilization or overutilization issues.