ProxmoxBackupHostOutOfMemory #
Node memory is filling up (< 10% left)
Alert Rule
alert: ProxmoxBackupHostOutOfMemory
annotations:
description: |-
Node memory is filling up (< 10% left)
VALUE = {{ $value }}
LABELS = {{ $labels }}
runbook: https://srerun.github.io/prometheus-alerts/runbooks/pbs-exporter/proxmoxbackuphostoutofmemory/
summary: Proxmox host out of memory (instance {{ $labels.instance }})
expr: pbs_host_memory_used / (pbs_host_memory_used + pbs_host_memory_free) * 100 >
90
for: 2m
labels:
severity: warning
Here is a runbook for the Prometheus alert rule ProxmoxBackupHostOutOfMemory
:
Meaning #
The ProxmoxBackupHostOutOfMemory
alert indicates that a Proxmox host is experiencing low memory availability. Specifically, the alert is triggered when the percentage of used memory exceeds 90% for 2 minutes or more. This can lead to performance issues, slow backups, and even crashes.
Impact #
The impact of this alert can be significant, as it may:
- Cause backups to fail or slow down, resulting in data loss or inconsistencies
- Lead to performance issues on the Proxmox host, and its dependent services
- Potentially crash the Proxmox host, leading to downtime and service unavailability
Diagnosis #
To diagnose the issue, follow these steps:
- Check the Proxmox host’s resource utilization (CPU, memory, disk usage) to identify the cause of the memory shortage.
- Verify that there are no resource-intensive processes or applications running on the host.
- Check the backup configuration and adjust it if necessary to reduce memory usage.
- Consider adding more memory to the Proxmox host or migrating to a host with more resources.
Mitigation #
To mitigate the issue, follow these steps:
- Immediately investigate and address the cause of the memory shortage.
- Implement a memory allocation strategy to ensure that critical services receive sufficient memory.
- Consider setting up memory reservations or limits for critical applications.
- Monitor the Proxmox host’s memory usage closely to prevent similar issues in the future.
Note: This runbook is meant to serve as a starting point and might need to be adapted to your specific environment and setup.