Let's play the Reliability Engineering Game


Details
Where do you start when you need to improve the reliability of your systems? How do you engineer a reliable system?
In this meetup we will interleave theory and practice to help you get a better understanding of Reliability Engineering (SRE). We will look at aspects such as managing risk, how to design your monitoring, what patterns to implement to improve the resilience of your systems, and how to make Platform, DevOps and SRE teams work together.
As part of the meetup we will play a small game where we put you in charge of a fictional systems landscape, one that suffers from low reliability and where outages of different kinds are frequent. You and your team need to improve availability, stability, agility and performance by selecting the right architectural patterns and DevOps practices.
Agenda:
18:00: Dinner
18:30: Reliability Engineering theory
19:30: Break
19:40: Reliability Engineering Game
21:00: Wrap up & Drinks

Let's play the Reliability Engineering Game