Low-latency ingestion and analytics with Apache Kafka and Apache Apex (Hadoop)


Details
Do come to pick up Apache Apex T-Shirts.
This talk will cover a fully fault tolerant, scalable, and operational ingestion from Kafka using Apache Apex application, running natively in Hadoop. The talks will deep dive into technical details of the connectors in Apache Malhar. Details of production use cases will also be discussed.
Agenda
6:00pm - Drinks, Food, and Socialize
6:15pm - Talk #1 will present how Apache Apex consumes from Kafka topics for real-time time processing and analytics. We will cover the features of the Apex Kafka Connector, which is one of the most popular operators in the Apex Malhar operator library, and powers several production use cases. We will explain the advanced features this operator provides for high throughput, low latency ingest and how it enables fault tolerant topologies with exactly once processing semantics.
7:00 - Q&A
7:15pm - Talk #2 covers recently added support in Apache Apex Malhar for the new Kafka 0.9 consumer API. We will cover how the new API has simplified certain aspects of the connector, performance and scalability considerations when consuming data from Kafka with Apache Apex, interoperability with MapR streams and plans for future enhancements
7:45pm - Q&A
8:00pm - Demo
8:15 - Q&A, Food, Drink, Socialize
Speakers:
-
Siyuan Hua, committer of Apache Apex
-
Thomas Weise, committer and PPMC member of Apache Apex. A Hadoop veteran

Low-latency ingestion and analytics with Apache Kafka and Apache Apex (Hadoop)