Skip to content

Details

In this series, we will have guests talk about their production outages, incidence response, postmortem analysis as well as lessions learnt from the production disruption.

Presenter: Marcel D. Juhnke (SRE @karrieretutor)

This talk will be about a traffic outage that occurred during the migration of workloads from one node pool to another one, something that shouldn't happen, but still cut off our Ingress controllers while a Pod migration was stuck due to disruption budgets.

Zoom: https://zoom.us/j/264733982
Slack: https://devopsonline.herokuapp.com/

We meet every two weeks.

Members are also interested in