NetdataLowDiskSpace #
Netdata low disk space (> 80%)
Alert Rule
alert: NetdataLowDiskSpace
annotations:
description: |-
Netdata low disk space (> 80%)
VALUE = {{ $value }}
LABELS = {{ $labels }}
runbook: https://srerun.github.io/prometheus-alerts/runbooks/netdata-internal/netdatalowdiskspace/
summary: Netdata low disk space (instance {{ $labels.instance }})
expr: 100 / netdata_disk_space_GB_average * netdata_disk_space_GB_average{dimension=~"avail|cached"}
< 20
for: 5m
labels:
severity: warning
Here is a sample runbook for the NetdataLowDiskSpace alert:
Meaning #
The NetdataLowDiskSpace alert is triggered when the available disk space on a Netdata instance falls below 20%. This means that the disk space is critically low, and immediate action is required to prevent data loss or system crashes.
Impact #
The impact of low disk space on a Netdata instance can be severe:
- Data loss: If the disk space is completely depleted, Netdata may not be able to write new data, leading to data loss.
- System crashes: Low disk space can cause system crashes or freezes, leading to downtime and disruption to monitoring and alerting capabilities.
- Performance degradation: Low disk space can also cause performance degradation, leading to slower response times and decreased system performance.
Diagnosis #
To diagnose the cause of the low disk space, follow these steps:
- Check the disk usage: Run the command
df -h
to check the disk usage on the Netdata instance. - Identify the cause: Check the output of the
df -h
command to identify the cause of the low disk space. Common causes include:- Log file growth: Check the log file sizes and rotate or delete unnecessary logs.
- Data file growth: Check the sizes of data files and consider archiving or deleting unnecessary data.
- Software updates: Check if any software updates have caused the disk space to decrease.
- Check the Netdata configuration: Check the Netdata configuration to ensure that it is set up correctly and that disk space is being monitored correctly.
Mitigation #
To mitigate the low disk space issue, follow these steps:
- Free up disk space: Immediately free up disk space by deleting unnecessary files, logs, and data.
- Configure log rotation: Configure log rotation to ensure that log files are regularly rotated and deleted.
- Configure data retention: Configure data retention policies to ensure that unnecessary data is deleted or archived.
- Increase disk space: Consider increasing the disk space on the Netdata instance or moving the data to a larger storage device.
- Monitor disk space: Closely monitor the disk space on the Netdata instance to prevent similar issues in the future.