This month's host will be Oracle at 1501 Fourth Avenue (Century Square Building) in Seattle at the Alki Room on the 3rd floor. Our talks for this event will be:
"Teaching Teams to DevOps – Empowering Service Owners; Lessons from SAP Concur + Oracle Cloud Infrastructure" - Jason Grimes
Digital transformations are hard, and even harder when your role is to empower and teach Service Owners best practices and give them the tools (collaboration, ChatOps, incident management and workflow) they need to do their best work in daily operations. I will dig into the 9 Keys to End-to-End Ownership and will discuss how we teach teams to get better at Ops while creating a sense of community and always learning.
In this talk, I will discuss how we are teaching team-focused DevOps in a world where we are advisors and subject matter experts as we take on the role of empowering our product and services teams. We will focus on how we help teams achieve a minimum viable operation with a focus on reliability, accountability and striving for improved MTTR and MTBF - because in the cloud, reputation *matters*.
Jason is a Sr. Technical Program Manager @ Oracle Cloud Infrastructure focusing operational readiness. Jason has been serving in technical operations since 1999, with 13+ years on call and is an expert in DevOps and SRE.
"Observability is the Key Tenet of Running a Multi-Tenant K8s Environment" - Thom McCann & Amreth Chandrasehar
How do you achieve 200 Million requests per day and no down time for 2 years? T-Mobile has been driving containerized workloads for many internal application teams. Running an internal multi-tenant environment can be challenging but has significant benefits
In this session we'll dive deep into observability components such as a large scale telemetry system built on Prometheus serving 4000+ requests per second and millions of metrics over 6 clusters. This open source system is built across 3 AZs in each region (US West and US East) and is federated across multiple Prometheus clusters enabling distributed queries and limitless scale.
We'll dive deep into how our operational teams can view integrated monitoring for infrastructure, hosts, VMs, containers and application level and integrate alerts with slack, pager duty and other real-time systems.
Interested in hosting or speaking at a future meetup? Reach out to the organizers in the left margin!