Apache Beam on Apache Flink


Details
Apache Beam is an open source, unified programming model for defining and executing data-parallel processing pipelines. These pipelines can be used to address many practical problems, including ETL, IoT, and both batch and streaming processing. A pipeline, once defined can be executed on many distributed processing back ends, including Apache Flink, Apache Spark, and Google Cloud Dataflow.
Of the available runners, Apache Flink is uniquely positioned as an attractive alternative outside of Google Cloude Dataflow. As Tyler Akidau, Apache Beam PMC, puts it:
For Apache Beam to achieve its goal of pipeline portability, we needed to have at least one runner which was sophisticated enough to be a compelling alternative to Cloud Dataflow when running on premise or on non-Google clouds. As you can see from these tables, Flink is the runner which currently fulfills those requirements. With Flink, Beam becomes a truly compelling platform for the industry.
This talk will help you become acquainted with Apache Beam, and see it in in action. Problem space, source code, live demonstration and pizza to be provided!

Sponsors
Apache Beam on Apache Flink