Break Things on Purpose

Is it better to build a system that fails all the time, but recovers gracefully, or one that rarely fails, but when it does, it's catastrophic? Jérôme Petazzoni (tinkerer extraordinaire and container technology educator) joins the pod to share an incident from the very early days of Docker (dotCloud) and how to apply the lessons learned when building with tools like Kubernetes.

Show Notes

Podcast Twitter: https://twitter.com/BTOPPod
Podcast email: podcast@gremlin.com
Jérôme's Twitter: https://twitter.com/jpetazzo

Episode Highlights:
  • Distributed databases at dotCloud & avoiding a major outage (2:18)
  • Multilayered Kubernetes lasagna (16:06)
  • Empowering others & what's important (24:22)
Episode transcript: https://www.gremlin.com/blog/podcast-break-things-on-purpose-jerome-petazzoni-tinkerer-and-container-technology-educator

What is Break Things on Purpose?

A podcast about site reliability engineering (SRE); Chaos Engineering; and the people, processes, and tools used to build resilient systems. Sponsored by Gremlin. Find us on Twitter at @BTOPpod.