Published Sep 3, 2019
SE-Radio Episode 284: John Allspaw on System Failures: Preventing, Responding, and Learning From
John Allspaw explores the complexities of system failures, emphasizing the significance of designing resilient systems and learning from post-mortem evaluations. He challenges conventional notions of human error, advocating for a nuanced understanding and the essential role of testing within production environments to preempt potential failures.

Topics covered
Popular Clips
Questions from this episode
- Asked by 28 people
- Asked by 2 people
Episode Highlights
Related Episodes


SE-Radio Episode 301: Jason Hand Handling Outages
Answers 383 questions

SE-Radio Episode 325: Tammy Butow on Chaos Engineering
Answers 383 questions

SE Radio 637: Steve Smith on Software Quality
Answers 383 questions

SE Radio 572: Gregory Kapfhammer on Flaky Tests
Answers 383 questions

SE-Radio Episode 242: Dave Thomas on Innovating Legacy Systems
Answers 383 questions

SE-Radio Episode 256: Jay Fields on Working Effectively with Unit Tests
Answers 383 questions

SE-Radio Episode 276: Björn Rabenstein on Site Reliability Engineering
Answers 383 questions

SE-Radio-Episode-280-Gerald-Weinberg-on-Bugs-Errors-and-Software-Quality
Answers 383 questions

SE-Radio Episode 344: Pat Helland on Web Scale
Answers 383 questions
SE-Radio Episode 332: John Doran on Fixing a Broken Development Process
Answers 383 questions

SE-Radio Episode 295: Michael Feathers on Legacy Code
Answers 383 questions

SE-Radio Episode 357: Adam Barr on Code Quality
Answers 383 questions

SE-Radio Episode 247: Andrew Phillips on DevOps
Answers 383 questions













