Disruption & Resilience
Small moves with outsized effects can help or hurt. Build graceful-degradation, buffers, and response drills.
Healthy uses
- Scenario libraries with triggers.
- Buffers for time, compute, inventory.
- Cross-training and role redundancy.
Risks to avoid
- Single points of failure.
- No drills; paper-only plans.
- Excessive coupling across systems.
Controls & guardrails
- Error budgets and brownout plans.
- Chaos/game days by quarter.
- Runbooks with time-boxed steps.
- Dependency maps with blast-radius limits.
Signals & metrics
- MTTD/MTTR
- Drill completion rate
- Dependency concentration
Tech hooks
Observability & Reliability (SLOs, chaos), DevOps (autoscale, brownouts), Cloud (multi-AZ), Supply Chain (alt corridors).
Related playbooks
Point A → Point B, Fast
Architecture Sprint (1–2 weeks): Current → Target → 1–90-day plan. D1 maps, D2 target, D3 plan, D4 exec brief.