What we’re about
Welcome to Streaming Stories hosted by RisingWave Labs!
Upcoming events (1)See all
- Is Kafka a New Data LakeLink visible for attendees
Using data lakes is critical for efficiently managing vast amounts of raw, unstructured, and semi-structured data. At the same time, Apache Kafka, a widely popular event streaming platform, has long been perceived as a “repository for recent data” in the modern data stack. Many data engineers use Kafka to hold recently ingested data, typically seven days to a month, before transferring this data into data lakes. Many consider event streaming platforms to be used for transient data, while data lakes are for historical data. But more recently, there has been an argument that Kafka is evolving into a new form of data lake.
During this panel discussion, we'll look into whether that claim holds up. Does Kafka have all the necessary properties to become a data lake? How does Kafka stand against the existing data lake managing frameworks such as Apache Iceberg, Apache Hudi, and Delta Lake? What key features does Kafka need to include to become a data lake? And finally, can Kafka take it over?
Join us and learn what our panelists have to say about these topics, as well as the overall evolution of data lakes and Kafka.