Skip to content
Cloud London Meetup #7

Details

Folks,

Our Next virtual event is scheduled for May 6 2021 .

Sponsor : GResearch

Speakers

17:30 - 18:30 -Sal Kimmich - Developer Advocate - Reliably

Topic: SRE in Cloud - From Error Budgets to SLO and SLI Automation

There's a major culture shift going on in tech today: both the software infrastructure and management/operations of people have become increasingly distributed and automated. Those who's been in tech in the last decade are likely very familiar with the concept of DevOps, defined as a set of practices that combines software development (Dev) and IT operations (Ops). The new field of Site Reliability Engineering was introduced in the last decade. Defined by Ben Treynor, founder of Google's Site Reliability Team, as "what happens when a software engineer is tasked with what used to be called operations", this new way of developing with reliability first changes the way we think about building software fundamentally.

In this talk we'll cover two important things:

  1. Handling incidents in post production is dangerous and costly, so shifting to an error budget can change the way you fundamentally engineer and monitor your system
  2. Micro-service architectures are complex, learn to automate the way you handle increasingly micro-service structures and their distributed dependencies with open source tooling

In the most basic definition, error budgets are simply the amount of error that a service can accumulate over a specified period of time before users grumble about the experience. We will cover common combinations of SLIs that lead to error budget best practices, as well as protocols that can be enacted when error budgets slip: the who, what, and when and why of pre-incident reporting..

Bio:
Sal Kimmich is the Developer Advocate for Reliably, the leading SRE automation tool. Sal is passionate about evolving the best practices of site reliability engineering, distributed computing and tracing. They care about the human-centered management of data-driven systems, helping people build data ecosystems that make sense, and solving hard problems through the clever use of math.

Many thanks
Cloud London.

Note: Meeting Details will be updated soon.

Photo of Cloud London group
Cloud London
See more events
Online event
This event has passed