Subscribe
Share
Share
Embed
This episode we speak with Michael Kehoe, a Staff Site Reliability Engineer at LinkedIn. Topics include: Site Reliability Engineering, building satellites at NASA, LinkedIn’s Chaos Engineering project called Waterbear, using Chaos Engineering to test autoscaling, running Chaos Engineering experiments as regression tests in a release pipeline, and tips for starting a Chaos Engineering practice at your company.
A podcast about site reliability engineering (SRE); Chaos Engineering; and the people, processes, and tools used to build resilient systems. Sponsored by Gremlin. Find us on Twitter at @BTOPpod.