ProxmoxBackupSnapshotVerifyFailed #
Last verified snapshot of vm {{ $labels.vm_name }} is older than 2 days.
Alert Rule
alert: ProxmoxBackupSnapshotVerifyFailed
annotations:
description: Last verified snapshot of vm {{ $labels.vm_name }} is older than 2
days.
runbook: https://srerun.github.io/prometheus-alerts/runbooks/pbs-exporter/proxmoxbackupsnapshotverifyfailed/
summary: Last verified Proxmox snapshot older than 2 days
expr: sum by (vm_id, vm_name) (max_over_time(pbs_snapshot_vm_last_verify[2d]) and
pbs_snapshot_vm_count > 1) == 0
for: 2m
labels:
severity: warning
Meaning #
The ProxmoxBackupSnapshotVerifyFailed alert is triggered when the last verified snapshot of a Proxmox machine (VM) is more than 2 days old. This alert indicates that the backup verification process for the VM has failed or not been performed recently.
Impact #
The impact of this alert is a potential loss of data in case of a VM failure or data corruption. If the last verified snapshot is older than 2 days, it means that the backup system has not been able to verify the integrity of the VM’s data recently. This increases the risk of data loss in case of a VM failure or data corruption, the backup system may not be able to restore the data to a consistent state.
Diagnosis #
To diagnose the issue, follow these steps:
- Check the Proxmox Backup Server (PBS) logs for the affected VM to identify any errors or issues.
- Verify that the PBS is configured correctly and the VM is properly registered.
- Check the PBS dashboard to ensure that the VM is being backed up regularly.
- Verify that the network connection between the PBS and the Proxmox cluster is stable and not experiencing any connectivity problems.
Mitigation #
To mitigate the issue, follow these steps:
- Immediately trigger a manual backup verification for the VM using the PBS dashboard.
- Verify that the PBS is configured correctly and the VM is properly registered.
- Check the PBS logs for any errors or issues and address them as needed.
- Consider increasing the frequency of backup verifications or implementing additional monitoring to detect potential issues earlier.
By following this runbook, you should be able to identify and resolve the root cause of the ProxmoxBackupSnapshotVerifyFailed alert, and ensure the integrity and availability of your Proxmox VMs.