Heuristics

Prioritise Recoverability Over Flawless Deployment

Authors: Anne-Marie Charrett

Submitter: Anne-Marie Charrett

In modern distributed systems, where should we focus our efforts: defect prevention or incident recovery?

Shift focus from attempting to eliminate every bug before deployment to ensuring rapid recovery from failure in production. The cost of a quickly remediated production incident is often lower than the cost of maintaining complex, slow, and brittle pre-production environments. Invest heavily in observability, monitoring, and fast rollback capabilities.

Example

We accept the risk of minor bugs reaching production, provided we can detect and roll back a change in under five minutes. This is more efficient than maintaining a costly and slow system integration test environment.

 

Tags

Follow us

Read our latest news from Virtual DDD on any of these social networks!

Recent heuristics

Discussed content

Sorry, no results found!

Whoops... we couldn't find what you're looking for