142 ms
12 msvs 1h ago
SLA ceiling: 200ms · 29% headroom
Are all services healthy and within SLA right now?
ms · p50 / p95 / p99 · last 60 min · red dashed = 200ms SLA
% 5xx errors · last 60 min
RPS · last 60 min · dashed line = 60-min avg
Open incidents sorted by severity · assigned on-call
| Incident | Service | Severity | Duration | On-call |
|---|---|---|---|---|
| DB latency spike | Database | SEV-2 | 24 min | A. García |
| Memory pressure | Worker pool | SEV-3 | 1h 12m | J. Kowalski |
| Slow query backlog | Database | SEV-3 | 38 min | A. García |
| CDN cache miss rate | CDN Edge | SEV-4 | 2h 04m | M. Chen |
Latency percentiles from Datadog APM (p50/p95/p99 of all API gateway requests). SLA 200ms applies to p99. Error rate = HTTP 5xx responses ÷ total requests. Uptime = availability of API gateway endpoint per external monitors. All figures are synthetic for illustration only.