Skip to content

Streaming Machine Learning on Flink

Photo of Niels Zeilemaker
Hosted By
Niels Z.
Streaming Machine Learning on Flink

Details

For our second Flink meetup, we're happy to announce that Márton Balassi is going to give a presentation on Streaming Machine Learning. Márton gave this presentation Flink Forward this year, and is a true expert on the topic.

Agenda:

• 18:00 Arrive, mingle, pizza, etc.

• 18:45 Streaming Machine Learning on Flink by Márton Balassi

As continuous big data processing is gaining popularity it naturally implies that there is a need to transition many of the distributed machine learning functionality to a streaming backend. The most common use case is to give streaming predictions based on the model learnt in batch, however in some cases it is beneficial to also update the model on the fly. It is not uncommon that streaming learners need different algorithms than their batch counterparts. The talk discusses the common use cases and the pitfalls of the streaming ML transition through the example of recommender systems. It also offer a dive into the implementation of a Scala library augmenting FlinkML with streaming predictors.

• 19:45 TBA

• 21:30: Everybody out

About Márton Balassi
Márton Balassi is a Solutions Architect at Cloudera and a PMC member at Apache Flink. His main focus is real-time distributed data processing frameworks. Márton has been a speaker at Hadoop Summit, ApacheCon and numerous Big Data related meetups recently.

Photo of Apache Flink Meetup Amsterdam group
Apache Flink Meetup Amsterdam
See more events
GoDataDriven
Wibautstraat 202 · Amsterdam