2:14:00 AMDatadog alert: "postgres.connection_pool.utilization > 95%"
2:14:15 AMPagerDuty receives alert
2:14:30 AMPagerDuty evaluates escalation policy
2:14:45 AMSMS + phone call to Marcus (on-call)
2:15:30 AMMarcus’s phone rings (asleep, startled awake)
2:16:00 AMMarcus acknowledges alert in PagerDuty app
2:18:00 AMMarcus opens laptop, VPNs in
2:22:00 AMMarcus SSHs to database server
2:28:00 AMDiagnoses: 89 idle connections leaked
2:35:00 AMExecutes: SELECT pg_terminate_backend(...)
2:38:00 AMVerifies: Pool healthy (9/95 connections)
2:42:00 AMMarks resolved in PagerDuty
2:45:00 AMTries to sleep (adrenaline still high)
4:30:00 AMFinally falls back asleep
7:00:00 AMAlarm (3 hours sleep total)
Marcus: 2.5 hours sleep lost. 40% productivity next day.
PagerDuty routed the alert reliably. Marcus still woke up, still manually fixed, still lost sleep.