ProxmoxBackupSnapshotOutdated #
Last snapshot of vm {{ $labels.vm_name }} is older than 2 days.
Alert Rule
alert: ProxmoxBackupSnapshotOutdated
annotations:
description: Last snapshot of vm {{ $labels.vm_name }} is older than 2 days.
runbook: https://srerun.github.io/prometheus-alerts/runbooks/pbs-exporter/proxmoxbackupsnapshotoutdated/
summary: Last snapshot of vm is older than 2 days
expr: (time() - avg_over_time(pbs_snapshot_vm_last_timestamp[5m])) / 3600 / 24 > 2
for: 2m
labels:
severity: warning
Meaning #
The ProxmoxBackupSnapshotOutdated` alert is triggered when the average time since the last snapshot of a virtual machine (VM) exceeds 2 days. This alert is critical as it may indicate a failure in the backup process, which can result in catastrophic consequences, including data loss, in the event of a system failure or data corruption.
Diagnosis #
To diagnose the issue, follow these steps:
- Check the Prometheus query
avg_over_time(pbs_snapshot_vm_last_timestamp[5m])
to verify the timestamp of the last snapshot. - Investigate the Proxmox backup logs to identify any errors or issues with the backup process.
- Verify the VM’s configuration and ensure that it is correctly set up for backup process.
Mitigation #
To mitigate the issue, follow these steps:
- Verify the Proxmox backup job is running successfully and scheduled correctly.
- Take an immediate snapshot of the affected machine(s) to ensure data integrity.
- Check and resolve any errors or issues found in the backup logs.
- Verify the snapshot is completed successfully.
- Consider adjusting the backup schedule to ensure more frequent snapshots are taken.
- Consider implementing additional monitoring and alerting for backup process to detect potential issues earlier.
Remember to update the runbook with any additional information or steps specific to your environment.