Skip to content

Apache Iceberg Meetup - Lakehouse Days, Bengaluru

Photo of e6data
Hosted By
e6data
Apache Iceberg Meetup - Lakehouse Days, Bengaluru

Details

We’re bringing Lakehouse Days back to Bengaluru, in collaboration with RisingWave! Join e6data for an exclusive in-person meetup designed for data engineers, architects, and senior software engineers. We will cover:

🔹 Apache Iceberg™ internals and Optimizations
🔹 Merge-on-read query, serverless compaction and Iceberg table sharing with RisingWave
🔹 Optimizing query performance
🔹 Handling data transfers with Apache Arrow Flight
🔹 Iceberg's integration with GCP
🔹 Real-world case studies from industry pros

Register Now: https://lu.ma/pd0r4bmr?utm_source=meetup

Speakers:

Rayees Pasha, CPO, RisingWave Labs
Topic: Streaming-first Approach to Iceberg with RisingWave
Summary: The session will provide an overview of the technical challenges of building a new Iceberg Table engine that is purpose-built for streaming workloads. The talk will highlight how RisingWave has built end-to-end key capabilities for Iceberg table management, including Iceberg’s merge-on-read query, Serverless Compaction, and Iceberg table sharing to allow direct queries from other engines. A key feature in this project is the native Iceberg compaction service written in Rust using Apache DataFusion and Apache Iceberg-Rust as foundational components.

Ankur Ranjan, Sr Software Engineer, e6data
Topic: Apache Arrow Flight: Reshaping How We Handle Data Transfers
Summary: In this talk, we will explore how Apache Arrow Flight overcomes the challenges of traditional protocols like ODBC and JDBC by providing a columnar-native transport that maintains data in its original format throughout the transfer process. Arrow Flight promises to enhance analytical workloads and align perfectly with modern data architectures by eliminating unnecessary conversions and streamlining data transfers. Join us to discover how this innovative approach can substantially improve data processing efficiency.

Sai Vineel Thamishetty, Sr Data Engineer, Walmart
Topic: Apache Iceberg with Google Cloud Platform (GCP)
Summary: This talk will explore the exciting developments with Apache Iceberg and its integration with Google Cloud Platform. Iceberg is now allowing users to store tables on Google Cloud Storage, which means we can use GCP’s scalable infrastructure alongside Iceberg’s performance enhancements. Popular data processing engines like Apache Spark and Trino have improved their support for Iceberg, making it easier for us to work with these tables directly in the cloud. There’s also a lot of buzz around improving interoperability with BigQuery, which could facilitate smoother data transfers and queries.

Mark your calendars for March 22, 2025! We’ll kick things off bright and early at 09:30 AM in Accel Launchpad, Koramangala!

Photo of Data Engineering Meetups group
Data Engineering Meetups
See more events
Accel LaunchPad
881, 6th Cross 6th Block, Club Road, Koramanagala, Koramangala, Bengaluru, Karnataka 560095 · Bengaluru