MongodbReplicaMemberUnhealthy #
MongoDB replica member is not healthy
Alert Rule
alert: MongodbReplicaMemberUnhealthy
annotations:
description: |-
MongoDB replica member is not healthy
VALUE = {{ $value }}
LABELS = {{ $labels }}
runbook: https://srerun.github.io/prometheus-alerts/runbooks/percona-mongodb-exporter/mongodbreplicamemberunhealthy/
summary: Mongodb replica member unhealthy (instance {{ $labels.instance }})
expr: mongodb_rs_members_health == 0
for: 0m
labels:
severity: critical
Here is a runbook for the Prometheus alert rule MongodbReplicaMemberUnhealthy
:
Meaning #
The MongodbReplicaMemberUnhealthy
alert is triggered when a MongoDB replica member is not healthy. This means that the replica member is not able to communicate with the primary node or other replica members, which can lead to data inconsistencies and impact the overall performance of the MongoDB cluster.
Impact #
The impact of an unhealthy MongoDB replica member can be significant, as it can lead to:
- Data inconsistencies and potential data loss
- Reduced performance and availability of the MongoDB cluster
- Increased latency and errors for applications relying on the MongoDB cluster
- Potential for secondary effects on dependent systems and services
Diagnosis #
To diagnose the issue, follow these steps:
- Check the MongoDB replica member’s logs for errors or warnings related to connectivity or replication.
- Verify the replica member’s status using the
mongo
shell or the Percona MongoDB Exporter. - Check the network connectivity between the replica member and the primary node.
- Validate the replica member’s configuration and ensure it matches the expected configuration.
Mitigation #
To mitigate the issue, follow these steps:
- Restart the MongoDB replica member service to attempt to re-establish connectivity.
- Check and repair any disk errors or corruption on the replica member.
- Verify the replica member’s configuration and update it if necessary.
- If the issue persists, consider re-syncing the replica member with the primary node.
- If the issue is related to network connectivity, investigate and resolve any underlying network issues.
Remember to refer to the Percona MongoDB Exporter documentation for additional guidance and best practices for troubleshooting and resolving MongoDB replica member health issues.