Skip to content

August Prometheus meetup

Photo of cindy
Hosted By
cindy
August Prometheus meetup

Details

Agenda

6:30pm - 6:50pm - Doors open, food and networking

6:50pm - 7:30pm - Talk 1 - Monitoring Cloudflare's Planet-Scale Edge Network with Prometheus

7:30pm - 7:40pm - Break

7:40pm - 8:30pm - Talk 2 - istio + Prometheus

8:30pm - 8:45pm - Networking

9:00pm - Doors close

Abstract

Talk 1: Monitoring Cloudflare's Planet-Scale Edge Network with Prometheus - Matt Bostock

Cloudflare operates a global anycast edge network serving content for 6 million web sites. This talk explains how we monitor our network, how we migrated from Nagios to Prometheus and the architecture we chose to provide maximum reliability for monitoring. We'll also discuss the impact of alert fatigue and how we reduced alert noise by analysing data, making alerts more actionable and alerting on symptoms rather than causes.

This talk will cover:

• The challenges of monitoring a high volume, anycast, edge network across 100+ locations

• The architecture we chose to maximise the reliability of our monitoring

• Why Prometheus excels as the new industry standard for modern monitoring

• Approaches for reducing alert noise and alert fatigue

• Triaging alerts into a ticket system

• Analysing past alert data for continuous improvement

• The pain points we endured

• Effecting change across engineering teams

Bio

Matt is a Platform Operations engineer at Cloudflare, where he has spent the last year promoting a monitoring utopia. He was previously tech lead for the GOV.UK Infrastructure team and is a keen contributor to open source software. He also loves bacon, avocado, running, and the Oxford comma.

Talk 2: istio + Prometheus - Zach Butcher

We introduce Istio, a new service mesh. We’ll describe how Istio generates Prometheus metrics on behalf of your application, how Istio uses Prometheus for it’s own monitoring, and where we’re heading with metrics in Istio.

Bio

Zack works at Google as one of the core contributors to the Istio project. Prior to working on Istio he worked on a variety of teams in Google Cloud Platform, focusing on authz, policy, data retention, and the internal system Istio is based on.

Photo of SF Prometheus Meetup Group group
SF Prometheus Meetup Group
See more events
Cloudflare
101 Townsend Street · San Francisco, CA