Past Meetup

#35. WHUG - "Assisting millions of active users in real-time with Apache Flink"

This Meetup is past

81 people went

Location image of event venue


We are happy to invite you on the 35th meetup of WHUG. We will host Grzegorz Kołakowski and Adam Kawa from GetInData ( Please notice that this time, we are going to meet in new location, in a pub "Lokal na Mokotowie" ( at Sandomierska 13, Warsaw (entrance from Rejtana)!
Please check details about meetup below.

"Assisting millions of your active users in real-time at telco, banking or e-commerce with Apache Flink"

"Nowadays many companies become data rich and intensive. They have millions of users generating billions of interactions and events per day. These massive streams of complex events can be processed and reacted upon to e.g. offer new products, next best actions, communicate to users or detect frauds, and quicker we can do it, the higher value we can generate.

Our presentation will be based on our recent experience in building a real-time data analytics platform for telco events. This platform has been jointly built by GetInData and the leading telco in Kazakhstan in just a few months and it currently runs in production at the scale of 10M subscribers and 160K events per seconds on average (300K eps in a peak) on a still small cluster. It's used as a backbone for personalized marketing campaigns, detecting frauds, cross-sell & up-sell by following the behavior of millions of users in real-time and reacting to it instantly.

We will share how we build such platform using current best of breed open-source projects like Flink, Kafka, and Nifi. We will also describe challenges that we faced during development and try to provide some tips what one should pay attention to when developing similar solutions, not only for telco, but also for banks, e-commerce, IoT and other industries."

Grzegorz Kołakowski (GetInData) - A software engineer with five years of experience. Recently a great enthusiast of stream processing and related open source tools, in particular Apache Flink and Apache Kafka. Currently, he is a data engineer at GetInData helping companies with building scalable, distributed systems for storing and processing big data volumes.

Adam Kawa (GetInData) - Adam became a fan of Big Data after implementing his first Hadoop job in 2010. Since then he has been working with Big Data at Spotify (where he had proudly operated one of the largest and fastest-growing Hadoop clusters in Europe), Truecaller, the University of Warsaw and Cloudera Training Partner. Over three years ago, he co-founded GetInData - a company that helps its customers to become data-driven and builds innovative Big Data solutions. Adam is also co-organizer of Warsaw Hadoop User Group and a frequent speaker at major Big Data conferences and meetups.

GetInData is HIRING!!!
We are looking for Big Data specialists to join our team:
- DATA ENGINEER, who would like to work with Java, Hadoop, Hive and learn/use Spark, NiFi, AWS and similar technologies to implement data-oriented pipelines and data-driven applications. Mid or Senior level

- BIG DATA ADMINISTRATOR, who would like to install, configure and secure Big Data platforms based on Hadoop-stack & Kerberos, and learn/use cloud-ready technologies such as Docker, Kubernetes for building innovative data infrastructures. Mid or Senior level.

If you are instructed in joining us, feel free to contact us at the meeting or send an e-mail to [masked].

Drinks & snacks
Free beers/drinks and snacks sponsored by GetInData will be available for the participants during the meetup.

See you!