About the event
- Date: 2024/05/20 (Monday) 18:00 (starts 18:30)
- Location: Didžioji str. 28, Vilnius (Wix Town Hall Office)
- Language: English
Agenda
6:00 p.m. – Welcome to Wix.com: doors open; treat yourself with drinks & snacks
6:30 p.m. – Distributed Tracing using Open Telemetry by Donatas Kučinskas
7:10 p.m. – Troubleshooting: from Art, to Science to Automation by Aviva Peisach
8:00 p.m. – Follow-up networking
About the talks
Distributed Tracing using Open Telemetry by Donatas Kučinskas
At Wix, we're running thousands of microservices in production, which makes such simple things as understanding a request flow or debugging a specific production request not that simple.
To overcome this, we've adapted various tools and practices. In this talk, we'll share our adventures with Distributed Tracing - what Distributed Tracing is, how we use it, and hopefully you'll learn something new about observability.
Troubleshooting: from Art, to Science to Automation by Aviva Peisach
In 2020 Wix faced a critical problem: production incident troubleshooting time was soaring, it took long minutes to hours to find the incident root cause and fix it.
With our scale and proliferation of domains, teams & products, it seemed impossible to find common denominators that will allow us to improve troubleshooting times across Wix. Afterall. troubleshooting was perceived as an art, only mastered by the most veteran and experienced developers in the team.
However we accepted the challenge and over the last 3 years we managed to improve MTTR of production incidents by 47%!
In this talk I'll describe the journey of taking (what seemed to be) an impossible challenge of wix’s inefficient troubleshooting, mapping it to find commonalities and patterns, onto creating tools, training and automations we use today. And how we transformed Wix engineering mindset and tools to effectively and quickly troubleshoot production!