Join us for our next Allegro Tech Talks in Poznan. This time we will focus on Site Reliability Engineering (SRE).
Topic: "What Allegro outages and US forest wildfires have in common?"
In our everyday work you have to deal with errors in the code, hardware failures, security problems, sometimes with ordinary human error. You can not totally eliminate these problems, even if you write the best code in the world, even if you introduce most restrictive safety procedures and your data center equipment will be at highest quality and from the best vendors. It is important that in case of emergency you are able to respond quickly to those events.And what about those forest wildfires? You will find the answer in my presentation.
Julian Szulc - Site Reliability Engineer, Allegro
Topic: "Failure Friday in Allegro."
Everyone wants to know if the thing they’ve built works according the initial assumptions. Among these requirement one can find stability, resistance to failure or anomaly in the system. This is what our Wednesday's Failure Friday are all about. We check if a specific portion of the system behaves put to the test.
Marek Jadżyn - Site Reliability Engineer, Allegro
Krzysztof Kliś - Site Reliability Engineer, Allegro
Topic: "How to recognize problem if you have 200 000 servers to monitor."
Imagine you have 200 000 servers doing hundreds and hundreds different tasks, for millions end users in thousands different locations. How do you manage it? How do you draw a line between hilarious performance and total disaster? Oh, and all that while making sure end users don't even notice there is any problem. This is what we are doing in Akamai every second, every day.
Piotr Dobrzański - with Akamai for almost 5 years now as Senior System Operation Engineer. Responsible for smooth operations in system operations and beyond, liaison for Akamai storage services. With past experience as Linux/Unix admin and one-man IT department.
Topic: "Engineering - it has nothing to do with magic!"
Although the twelve networking truths covered in RFC1925 were suppose to be April Fool's joke, they are actually a list of valid points every Engineer should remember. I will try to show how a two-years-long project that is about to create central logging environment fits to every aspect of these truths. This presentation shows also how engineering is about measuring and calculating but not magic.
Krzysztof Krzyżaniak - For twenty years in Internet Bussiness I've covered more or less everything on engineering side. Recently, for last two years I am acting as Senior Devops Tech Leader in Egnyte Inc, focusing mostly on creating Software-defined Datacenters.
18:00 - Rozpoczęcie
18:10 - "What Allegro outages and US forest wildfires have in common?" Julian Szulc, Allegro
18:40 - "Engineering - it has nothing to do with magic!" Krzysztof Krzyżaniak, Egnyte
19:10 - "Failure Friday in Allegro." Krzysztof KIliś, Marek Jadzyn, Allegro
19:40 - “How to recognize problem if you have[masked] servers to monitor." Piotr Dobrzański, Akami
20:10 - Networking (Pizza)
The meeting will be held in Polish.
Please stay tuned for further updates.
Allegro Tech Talks is a series of meetups where we will talk about projects that we run, problems that we face and unique solutions that we implement. What is more, we will invite external guests to learn about their experiences and point of view.