Building Resilient Systems: Fault Tolerance & Chaos Engineering
Details
## Building Resilient Systems: Fault Tolerance & Chaos Engineering
### Duration
2 Hours
### Topics
#### Failure as a Design Principle
- Partial Failure
- Cascading Failure
#### Reliability Patterns
- Retry
- Timeout
- Exponential Backoff
- Jitter
#### Circuit Breaker Pattern
- Preventing Service Meltdown
- Recovery Mechanisms
#### Chaos Engineering
- Why Netflix Created Chaos Monkey
- Failure Injection
#### Real Production Incidents
- AWS Outages
- Database Failures
- Service Dependency Failures
### Outcome
Participants will:
- Design self-healing systems
- Handle failures gracefully
- Reduce production incidents
- Apply SRE reliability concepts
Join Zoom Meeting
[https://us02web.zoom.us/j/83228631125?pwd=eJERanOWJ4Dp95q0BhbAj5Ow2EsDaf.1](https://www.google.com/url?q=https://us02web.zoom.us/j/83228631125?pwd%3DeJERanOWJ4Dp95q0BhbAj5Ow2EsDaf.1&sa=D&source=calendar&usd=2&usg=AOvVaw1reSAGvuGStIgCHgGYmPfw)
Meeting chat link
[https://us02web.zoom.us/launch/jc/83228631125](https://www.google.com/url?q=https://us02web.zoom.us/launch/jc/83228631125&sa=D&source=calendar&usd=2&usg=AOvVaw2IiLoDwg-9eh12TGtsbu9P)
Meeting ID: 832 2863 1125
Passcode: 822524
