Reliability operations

Incident command center

Coordinate active incidents with service health, owners, customer impact, communication review, and timeline evidence in one workspace.

SEV-2Mitigating

Elevated checkout API errors

Checkout creation intermittently returns 502 responses after the edge gateway deploy.

Duration

42 min

Region

US + EU

Services

2

Commander

Maya Chen

Deputy: Sam Patel

Customer impact

128 accounts

Impacted region: US + EU

Task readiness

1 of 4 tasks complete

Affected services

Current component health from monitoring and incident annotations.

Checkout API

Linked to monitor and runbook

Degraded7.8% errorsdown

Edge gateway

Linked to monitor and runbook

DegradedP95 2.4sdown

Billing worker

Linked to monitor and runbook

Operational0.2% errorsflat

Roll back edge gateway release

API on-call

DoneNow

Verify checkout creation from EU test tenant

QA lead

In progress8 min

Prepare enterprise customer note

Support lead

Waiting15 min

Open postmortem draft

Incident deputy

WaitingAfter mitigation