Skip to content

Details

This event is sponsored by PagerDuty

Hey everyone!

We're back with another edition of the Observability Engineering London meetup. This time, we're exploring the world of SLOs at scale.

Join us on Thursday, July 11th, where Alex, Senior Site Reliability Engineer at Google, will present the evolution of Service Level Objectives (SLOs) for the GCE Compute API over the past eight years. He'll start with the initial 30 SLOs, move through a phase with around a thousand, and end with millions of per-customer SLOs. He'll share anecdotes, techniques for handling low-QPS (continuous over discrete metrics), and strategies for aggregating data to enhance leadership visibility. He'll also give practical tips for running and improving this system in production.

๐Ÿ‘พ Gameplan:
6:00 Food and drinks
6:15 Welcome
6:30 Alex Palcuie | Going from 30 to 30 Million SLOs
7:00 Break | networking
7:10 Community discussion
8:00 Networking
8:30 Wrap up and head to the pub downstairs to keep the conversation going.

๐Ÿ‘‹ Connect with us

See you all there!
Karim
----------------
A Bit About Alex Palcuie | Alex has been a Site Reliability Engineer at the organization managing the GCE Compute API for over 7 years. He's had his hands in nearly every aspect of the control plane, from rollouts and observability to disaster recovery. Additionally, he's a member of the Tech Incident Response Team (Tech-IRT), tackling outages like powering down data centers due to water leaks or scrambling for capacity during Black Friday.

Talk | Going from 30 to 30 Million SLOs
Alex will present the evolution of Service Level Objectives (SLOs) for the GCE Compute API over the past eight years. He'll start with the initial 30 SLOs, move through a phase with around a thousand, and end with millions of per-customer SLOs. He'll share anecdotes, techniques for handling low-QPS (continuous over discrete metrics), and strategies for aggregating data to enhance leadership visibility. He'll also give practical tips for running and improving this system in production.

You can connect with Alex on Linkedin.
----------------

If you would like to give a presentation at one of our events, please DM Karim.

Related topics

Events in London, GB
Make New Friends
Software Engineering
DevOps
Observability
Site Reliability Engineering (SRE)

You may also like