Past Meetup

"Stream Processing with Apache Flink" w/ Flink PMC Robert Metzger

This Meetup is past

59 people went

Details

It's a great honor to have Apache Flink PMC and Committer Robert Metzger here for the opening talk of the 2nd Flink Taiwan User Group meetup! Robert will be providing an overview of the system internals, streaming-first philosophy, and programming APIs of Apache Flink.

For the second part of the night, we will also have an introductory talk on the Flink CEP (Complex Event Processing) Library. The talk will be given by Gordon Tai, co-organizer of the Apache Flink Taiwan User Group.

It's going to be a great night with lots of content, so definitely don't miss out on this one!

第二次的 Apache Flink Taiwan User Group Meetup 很有榮幸的邀請到了 Apache Flink PMC / Committer Robert Metzger 來分享第一場演講!Robert 的分享將會深入 Apache Flink 以串流為主的系統核心運作方式以及涵蓋撰寫應用的 API。

其中下半場將會由 Apache Flink Taiwan User Group 共同經營人 Gordon Tai 負責,將會為大家初步介紹 Flink CEP (Complex Event Processing) 套件。

相信今晚的內容將會非常豐富,對於 Apache Flink 有興趣的朋友千萬別錯過!

Schedule:

7:00 - 7:30 PM
Networking & Snacks

7:30 - 8:30 PM
Stream Processing with Apache Flink - Robert Metzger
(Talk will be given in English)

8:30 - 9:30 PM
An Introduction to Apache Flink CEP Library - Gordon Tai
(Talk will be given in English)

Talk Descriptions:

Stream Processing with Apache Flink

Data streaming is gaining popularity, as more and more organizations are realizing that the nature of their data production is continuous and unbounded, and can be better served with a streaming architecture. Streaming architectures promise decreased latency from signal to decision, a radically simplified data infrastructure architecture, and the ability to cope with new data that is generated continuously. Apache Flink is a full-featured true stream processing framework with:

- Easy to use Java- and Scala-embedded APIs that make stream analytics easy, yet provide powerful tools to deal with time and uncertainty
- Throughput close to a million of events per second per core
- Latencies as low as the millisecond range
- Full support for event time and out of order arrivals with flexible windows, watermarks, and triggers
- Exactly-once consistency guarantees, and the ability to realize distributed transactional data movement between systems (e.g., between Kafka and HDFS)
- Ease of configuration and separation between application logic and fault tolerance via a novel asynchronous checkpointing algorithm
- No single point of failure
- Integration with popular open source infrastructure (e.g., Hadoop, HBase, Kafka, Cascading, Elasticsearch, …)
- Batch processing as a special case of stream processing, including dedicated libraries for machine learning and graph processing, managed memory on-, and off-heap, and query optimization

Flink is used in several companies, including Alibaba, Bouygues Telecom, King.com, and Capital One, and has a large and active developer community of well over 170 contributors. In this talk, we provide an overview of the system internals and its streaming-first philosophy, as well as the programming APIs.

Introduction to Apache Flink CEP Library

The Flink CEP (Complex Event Processing) Library, authored and shepherd by Flink PMC and Committer Till Rohrmann, allows detection of complex event patterns in a stream of endless data. For this talk, we will be covering:

1. An overlook of applications and scenarios that the CEP library is useful for.
2. Introduction to the API that CEP offers for pattern matching on event streams.
3. The NFA (nondeterministic finite automation) implementation that the Flink CEP was based upon to detect event patterns, for people who also likes to know the underlying details ;)

Speaker Bios:

Robert Metzger

Robert Metzger is a PMC member at the Apache Flink project and a co-founder and software engineer at Data Artisans. He is the author of many Flink components including the Kafka and YARN connectors. Robert studied Computer Science at TU Berlin and worked at IBM Germany and at the IBM Almaden Research Center in San Jose. He is a frequent speaker at conferences such as the QCon London, Hadoop Summit in San Jose, ApacheCon Big Data in Budapest, and meetups in Europe and the US.

Tzu-Li (Gordon) Tai

Gordon Tai is a co-organizer of the Apache Flink Taiwan User Group, and loves working on Flink during his free time. He has recently contributed the Amazon Kinesis Streams connector for Flink. Currently, Gordon works as a software engineer at VMFive, building data platforms to analyze and handle streaming data of the mobile advertisement industry. He has previous experience in Academia Sinica at Taiwan, where he researched on distributed systems including Apache Spark, Apache Storm, and YARN.

Sponsors

A big thanks to VMFive (http://www.vmfive.com) for supporting the meetup space!

非常感謝 VMFive (http://www.vmfive.com/) 提供本次 Flink.tw 聚會的活動場地!