Managing Chaos at Scale

45:19 210 views 0% Published 2 years ago

This talk by Paweł Królikowski describes how Uber tries to be reliable. With an explosive growth from a few to thousands of microservices there’s a few things they did right, but also a lot of things they had to learn the hard way. He starts with incident prevention: integration testing, load testing, chaos testing, blackbox testing, rollout strategies. He then follows with incident response: on-call, monitoring, alerting, mitigation strategies and touches briefly on benefits of using common frameworks in reliability engineering.

The global dev community meets at WeAreDevelopers, an event dubbed by many as the “Woodstock of Developers”. The WeAreDevelopers World Congress 2018 brought together 8,000 techies from 70 countries for 72-hours of pure dev-fun.

Visit the largest developer playground in Europe!



©2018, WeAreDevelopers

Link Original video