In the course of #opslife, we run into production incidents. How do we best manage them to avoid 3am misery? Matt Stratton of PagerDuty joins us to talk about just that.
“Is there any strong objection?” - How to have a (relatively) stress free time during an outage.
Major outages, incident calls, war rooms, whatever you want to label them, can be stressful and frustrating experiences. However, we aren't the only industry to have run into these problems. What can we learn from others on how to have a relatively stress free experience? How can we shorten the time that it takes to get back to a working state when things are broken?
This talk will provide some comparisons to responses in other industries, and then go through several patterns and processes any team or company can use to have a quick, visible, and easy time responding to problems.
Matt Stratton is a DevOps Evangelist at PagerDuty, where he help dev and ops teams advance the practice of their craft and become more operationally mature. He is the founder and co-host of the popular Arrested DevOps podcast, he collaborates with PagerDuty customers and industry thought leaders in the broader DevOps community, and his license plate actually says “DevOps”.
Bonus short talk! After the main talk, we'll hear from Ben Kochie about the Prometheus Monitoring System; he'll provide a short introduction to Prometheus, a popular open source monitoring system.
Ben Kochie is a Minnesota native now living in Germany. After leaving Minnesota, he spent many years as a SRE / Systems Engineer at Google and SoundCloud. He now leads the monitoring product team at GItLab.
Dinner & drinks will be served. SPS Commerce is our venue host; food and drink sponsorship provided by Diamanti (www.diamanti.com).
6pm: Doors open
6:30pm: Welcome, sponsor, and Matt Stratton
7:45pm: Ben Kochie