Apache Druid: Kafka’s missing database

Name: Apache Druid: Kafka’s missing database
Start: 2026-02-12T18:30:00-06:00
End: 2026-02-12T20:00:00-06:00

Hosted by Robert L. and 3 others

St. Louis Java User's Group

Details

Apache Kafka is an amazing streaming platform, and along with streaming libraries like Kafka Streams, and tools like Apache Flink, it can meet so many of our real-time data processing needs. But it’s always been a challenge to query that streaming data. The most common pattern to meet this need is to use Kafka Connect to write data out to a database with which you can then query to your heart’s content. This works but it adds unnecessary latency.

Apache Druid is a real-time database designed with Kafka in mind. Druid thinks about data the same way that Kafka does. With direct Kafka integration, Druid allows us to query real-time data, in, well, real-time. Even before the data is fully loaded into Druid, it is available to respond to queries. When I first learned how Druid does this I was blown away.

In this session, we’ll get an overview of Apache Kafka and Apache Druid, and then we’ll focus in on the way Druid ingests, and queries events from Kafka with such amazing speed. We’ll also see how Druid can combine new incoming event data from Kafka and older stored data in the same query.

And, since you’ll probably be as impressed with Druid as I was, I’ll leave you with some resources to continue your learning journey.

St. Louis Java User's Group

Object Computing Inc. (OCI)

JetBrains

Manning

Apache Druid: Kafka’s missing database

St. Louis Java User's Group

Details

Related topics

Sponsors

Object Computing Inc. (OCI)

JetBrains

Manning

You may also like