CassandraClientRequestWriteFailure

CassandraClientRequestWriteFailure #

A lot of write failures encountered. A write failure is a non-timeout exception encountered during a write request. Examine the reason map to find to the root cause. The most common cause for this type of error is when batch sizes are too large.

Alert Rule
alert: CassandraClientRequestWriteFailure
annotations:
  description: |-
    A lot of write failures encountered. A write failure is a non-timeout exception encountered during a write request. Examine the reason map to find to the root cause. The most common cause for this type of error is when batch sizes are too large.
      VALUE = {{ $value }}
      LABELS = {{ $labels }}    
  runbook: https://srerun.github.io/prometheus-alerts/runbooks/criteo-cassandra-exporter/cassandraclientrequestwritefailure/
  summary: Cassandra client request write failure (instance {{ $labels.instance }})
expr: increase(cassandra_stats{name="org:apache:cassandra:metrics:clientrequest:write:failures:oneminuterate"}[1m])
  > 0
for: 0m
labels:
  severity: critical

Meaning #

Impact #

Diagnosis #

Mitigation #