Skip to content

Details

## Building Resilient Systems: Fault Tolerance & Chaos Engineering

### Duration

2 Hours

### Topics

#### Failure as a Design Principle

  • Partial Failure
  • Cascading Failure

#### Reliability Patterns

  • Retry
  • Timeout
  • Exponential Backoff
  • Jitter

#### Circuit Breaker Pattern

  • Preventing Service Meltdown
  • Recovery Mechanisms

#### Chaos Engineering

  • Why Netflix Created Chaos Monkey
  • Failure Injection

#### Real Production Incidents

  • AWS Outages
  • Database Failures
  • Service Dependency Failures

### Outcome

Participants will:

  • Design self-healing systems
  • Handle failures gracefully
  • Reduce production incidents
  • Apply SRE reliability concepts

Join Zoom Meeting

[https://us02web.zoom.us/j/83228631125?pwd=eJERanOWJ4Dp95q0BhbAj5Ow2EsDaf.1](https://www.google.com/url?q=https://us02web.zoom.us/j/83228631125?pwd%3DeJERanOWJ4Dp95q0BhbAj5Ow2EsDaf.1&sa=D&source=calendar&usd=2&usg=AOvVaw1reSAGvuGStIgCHgGYmPfw)

Meeting chat link
[https://us02web.zoom.us/launch/jc/83228631125](https://www.google.com/url?q=https://us02web.zoom.us/launch/jc/83228631125&sa=D&source=calendar&usd=2&usg=AOvVaw2IiLoDwg-9eh12TGtsbu9P)

Meeting ID: 832 2863 1125
Passcode: 822524

Related topics

Cloud Computing
Distributed Systems
Golang
Python
DevOps

You may also like