Big Data & Analytics meetup 2020/02 - Distributed stream processing

Budapest Big Data & Analytics Meetup
Budapest Big Data & Analytics Meetup
Public group

Cloudera Budapest Office

Széchenyi István tér 7 · Budapest

How to find us

Approach the elevator banks opposite the main reception in the Roosevelt office building and Cloudera staff members will guide you to the office on the 7th floor.

Location image of event venue

Details

This meetup will focus on streaming data technologies such as Structured Streaming in Apache Spark and the Apache Flink Streaming engine.

Planned speakers and talks:

1) Streaming Technologies Intro
Marton Balassi, Cloudera

This talk will provide an overview of the current streaming technology landscape.

Marton is an Engineering Manager at Cloudera. He is an Apache Flink PMC member and one of the first contributors to the streaming API. He has driven big data adoption at around 50 customers as a Senior Solutions Architect at Cloudera during the last four years. He is the manager of the recently formed Streaming Analytics team and focuses on adding Flink to the Cloudera platform.

2) Introduction to Spark Streaming
Gabor Somogyi, Cloudera

Spark's latest streaming engine is Structured Streaming which is a scalable and fault-tolerant stream processing engine built on the Spark SQL engine. One can express streaming computation the same way you would express a batch computation on static data. The Spark SQL engine will take care of running it incrementally and continuously and updating the final result as streaming data continues to arrive.

Gabor is a Software Engineer at Cloudera and an Apache Spark contributor who made major improvements in Spark's Kafka connector in Spark 3.0.

3) What's new with Flink Streaming
Marton Balassi, Cloudera

Apache Flink Streaming is a low latency, distributed data processing engine that focuses on stateful jobs and sophisticated windowing. Enterprises across industries rely on Flink to deliver latency critical business value in a range of use cases including Alibaba, Netflix or ING Bank. The Flink community is focused on refining its SQL API in the latest release to democratize stream processing with opening towards the BI analyst community.

Schedule:
18:00 Doors open
18:30 Talks begin
20:00 Followup discussion

This event is jointly organized with the Future of Data: Budapest meetup group (http://bit.ly/3beBuoR)

Venue and catering will be provided by Cloudera Hungary. This is an English speaking event.