Skip to content

Kafka Streams Applications

Photo of shani einav
Hosted By
shani e.
Kafka Streams Applications

Details

18:00 - 18:30: Networking, mingling & refreshments.

18:30 - 19:00: Real-time fraud detection with Kafka Streams.
Ofir Sharony @ MyHeritage.

19:00 - 19:30: Querying Kafka with Presto.
Itamar Syn-Hershko @ BigData Boutique.

*** All talks are delivered in English and live-streamed via YouTube ***

First session description:
In this talk, we'll build a gatekeeper to your website. Our fraud detection system will target various types of malicious activities, such as account takeover, parameter tampering, forbidden access and more. We'll try to identify potential attacks and react to them in near real-time. Addressing this problem in a classical batch fashion will result with a complex, non-scalable nor real-time solution. We'll adjust our clumsy implementation to a modern, stream processing windowed-aggregation, use Kafka Streams as our streaming framework, and end up with a beautiful, clean and maintainable code.

Bio:
Ofir is a BackEnd team lead at MyHeritage, with a passion for event-driven design and stream processing frameworks. Ofir has acquired most of his experience by planning scalable server-side solutions and developing data pipelines. Ofir has spoken of these ideas in local and global conferences and wrote about them here: https://medium.com/@ofirsharonys.

*******************************************************************************

Second session description:
Presto is a state of the art Distributed SQL Query Engine for BigData, enabling efficient querying on cold data and various data sources. With extended SQL language and features like geospatial queries, joins between different data sources (SQL to join data from HDFS, Elasticsearch, and Kafka anyone?), and the ability to run on containers and cheap servers, Presto is slowly becoming the standard ad-hoc querying engine for BigData.

In this talk, we will present Presto and how it can be used with Kafka. We will discuss data architectures, Presto features and why is it so good for your data, and finally see how it can be leveraged to querying data from Kafka as well as executing a single SQL statement that joins data from Kafka on data from SQL, Cassandra, Elastic and more.

Bio:
I'm a search technologies, BigData, and distributed systems expert. Over the years I have built and maintained several big mission-critical systems on both Windows and Linux, and gained a lot of experience I now use to perfect systems built to deal with scale. Today I'm a frequent speaker at international conferences and provide on-site training and consultancy services around the world via BigData Boutique.

Photo of #ApacheKafkaIL group
#ApacheKafkaIL
See more events