How Alert Fatigue Destroys MTTR (and Burns Your Error Budget)
Updated April 2026 | Sources: DORA 2024 Accelerate State of DevOps, Honeycomb 2024 Observability Maturity Survey, Atlassian MTTR Benchmarks
What MTTA and MTTR Mean
How long it takes an on-call engineer to confirm they are investigating an alert. Industry median: 8-15 minutes. Healthy: under 5 minutes. Alert fatigue degrades MTTA first.
Total time from incident detection to service restoration. DORA 2024 elite: under 1 hour. Low performers: over 1 week. Alert fatigue is one of the top three contributors to long MTTR.
How long from an incident starting to its first alert triggering. Alert fatigue indirectly degrades MTTD: fatigued teams silence noisy monitors, reducing coverage.
99.9% SLO = 43 minutes/month. 99.95% = 21.9 minutes/month. Alert fatigue burns error budgets faster via longer detection and resolution gaps.
How Noise Degrades MTTA and MTTR
As false-positive ratio increases above 60%, engineers begin pattern-matching alerts to past false positives before investigating. This adds 2-5 minutes to MTTA per page as they mentally categorise the alert before acting. At 80%+ false positive rate, engineers begin acknowledging without investigating until they receive a second page or customer report.
After 6+ months of high page volume, engineers begin acknowledging night pages with a 10-30 minute delay or miss them entirely. The incident.io 2024 survey found 62% report weekly sleep disruption; the natural response is to silence the phone between 11pm and 6am. Real P1s in this window are delayed by hours.
Fatigued engineers conducting simultaneous investigations on multiple concurrent alerts miss the correlating signals that indicate a common root cause. Investigation proceeds on each alert independently. MTTR degrades because root-cause identification takes longer when the engineer cannot see the pattern across 15 simultaneous pages.
Fatigued teams skip post-mortems or conduct cursory ones. Root causes recur. Without a post-mortem culture, alert rules are never updated based on incident learnings, the false-positive ratio continues to climb, and MTTR degrades with each cycle.
DORA 2024 MTTR Benchmarks
The DORA State of DevOps 2024 report classifies organisations into four performance tiers. The gap between elite and low performers is now more than 2,100x on MTTR.
| Tier | MTTR target | Alert discipline typically observed | SLO adoption |
|---|---|---|---|
| Elite | < 1 hour | SLO-based alerting, weekly audits, correlation enabled, runbooks for every P1 | High |
| High | < 1 day | Partial SLO adoption, severity tiers defined, deduplication enabled | Medium |
| Medium | 1 day - 1 week | Threshold alerting, some runbooks, ad-hoc audits | Low |
| Low | > 1 week | All-threshold alerting, no runbooks, alert debt accumulating | None |
Source: DORA 2024 Accelerate State of DevOps Report (dora.dev). Note: DORA does not directly measure alert discipline; the correlation is inferred from MTTR and incident management maturity dimensions.