Building Outage Proof Observability with AI
Details
Event: Outage-Proof Monitoring – Lessons from the AWS us-east-1 Outage
- What Happened: Explore how one of the biggest AWS outages exposed weaknesses in traditional monitoring.
- The Solution: Learn how LLMs and fallback mechanisms enabled reliable, outage-resilient observability.
- Key Insights: See how intelligent status-page analysis improves detection when cloud services fail.
- Who Should Attend: SREs, DevOps, and platform engineers building resilient monitoring systems.
