Skip to content

Apache Iceberg Meetup - Lakehouse Days, Hyderabad

Photo of e6data
Hosted By
e6data
Apache Iceberg Meetup - Lakehouse Days, Hyderabad

Details

Lakehouse Days - Powered by AWS is designed specifically for data engineers, data architects, and senior software engineers who constantly seek to optimize their data architecture to make it more price-performant while delivering the best user experience.

Register here: https://lu.ma/ahuq2jqz?utm_source=meetup

In this edition, we will dive deep into the internal architecture of open table formats like Apache Iceberg, how Apache Kafka works, building a modern data platform that simultaneously queries streaming and analytical data on Iceberg, how Amazon S3 Tables delivers a fully managed Apache Iceberg experience to simplify large-scale analytics on Amazon S3, and how Arrow IPC enhances Apache Iceberg-based data lakes by accelerating streaming ingestion and query execution. We aim to raise awareness about these open-table formats and gain a deeper understanding.

Lakehouse Days - Powered by AWS is designed to enable fellow data geeks to meet, network, and have insightful discussions on the entropic world of data.

Speakers:

Diptiman Raichaudhari, Staff Developer Advocate, Confluent

Topic: Streaming data into a Lakehouse - Kafka greets Iceberg
Summary: This session will start from the ground up on what Iceberg is, how Kafka works, and the community efforts behind two of the most important frameworks, Apache Kafka and Apache Iceberg, coming closer.
It will guide you in building a modern data platform that simultaneously queries streaming and analytical data on Iceberg.
Speaker Bio: Diptiman is a Staff Developer Advocate at Confluent. He designed and implemented ‘Modern Data Platform’ for large-scale enterprise use cases. He works at the intersection of Data (Kafka, Flink, Spark, Kinesis, Redshift, Iceberg, Glue, Hive, Neo4j, Neptune) and AI (Torch, Sagemaker, Vertex AI, Kubeflow, LLMs) at the cloud scale (AWS and Google Cloud).

David John Chakram, Principal Architect, AWS
Topic: Amazon S3 Tables: Scaling Apache Iceberg for High-Performance Analytics
Summary: In this session, you’ll explore how Open Table Formats (OTFs) like Apache Iceberg revolutionize how organizations store and process tabular data at scale. We’ll explore Iceberg’s key features and advantages over traditional approaches and how Amazon S3 Tables, AWS’s latest innovation, delivers a fully managed Apache Iceberg experience to simplify large-scale analytics on Amazon S3.
Speaker Bio: David John Chakram is a Principal Solutions Architect at AWS. He specializes in building data platforms and architecting seamless data ecosystems. With a profound passion for databases, data analytics, and machine learning, he excels at transforming complex data challenges into innovative solutions and driving businesses forward with data-driven insights.

Karthic Rao, Principal Engineer, e6data
Topic: Fast Distributed Iceberg Writes and Queries with Apache Arrow IPC
Summary: Learn how Arrow IPC enhances Apache Iceberg-based data lakes by accelerating streaming ingestion and query execution. Unnderstand Arrow IPC’s zero-copy data sharing and high-speed transport via Arrow Flight, which streamlines data movement and aligns seamlessly with Iceberg’s columnar storage. Experience hands-on demonstrations on how Arrow IPC unifies fast writes and queries, delivering efficiency and scalability to Iceberg data platforms.
Speaker Bio: Karthic Rao is a seasoned engineer and open-source enthusiast with 10+ years of experience tackling early-stage, high-impact projects. His extensive open-source background includes being an early engineer at Minio (distributed object storage), an early Caddy web server project member, and a Product and DevRel lead at Dgraph (graph database).

Photo of Data Engineering Meetups group
Data Engineering Meetups
See more events
Amazon Development Centre (HYD11)
Financial District, Nanakramguda, Hyderabad, Telangana 500032 · Hyderabad