Apache Iceberg Bay Area Community Meetup


Details
Organizer: ChanChan Mao, Aihua Xu, Tej Luthra
Date: November 4th, 2024
Time: 5-8PM PST
As space is limited, to attend the event you must fill out the form below to request a spot at the event:
## Agenda
5:00p - 6:00p: Doors Open & Networking 💃
6:00p - 7:25p: Welcome Remarks & Presentations!
7:25p - 8:30p: More Networking 🕺
## Presentations
🌟 Accelerate Your Iceberg Workloads on S3
This talk discusses the recent improvements that Amazon S3 team has been doing in Iceberg FileIO and LocationProvider to improve Iceberg user experience on S3. This includes better retry and fault tolerant executions (#10433 & #11052), better hashing scheme to reduce throttling (#11112), and integration with S3 Data Acceleration Toolkit and AWS CRT client to improve read performance.
**Jack Ye** is a Sr. Software Engineer at AWS Open Data Analytics. His team focuses on the integration of open source storage layer solutions including Iceberg, Hudi, Delta, Parquet, Avro, etc. with AWS analytics products. Jack is also a PMC member of the Iceberg project.
**Roni Burd** is Dir of Product Engineering at AWS, and builds platform and developer tools. Roni brings 15+ years of experience working in the query engines, storage engines, and compute platform for database systems and ML processing.
🌟 How We Implemented the Iceberg Connector in Rust!
In this talk, we will discuss how we implemented the Iceberg connector in Rust, replacing the original Java-wrapped version to address performance bottlenecks in serialization and memory usage. By following the Apache Iceberg specification, we built a native Rust connector that supports Iceberg’s advanced features, such as multi-catalog compatibility and streaming updates. We’ve contributed this new version to the apache/iceberg-rust repository, and will share insights into the architectural improvements and best practices for leveraging Iceberg in streaming environments.
**Yingjun Wu** is the founder of RisingWave Labs), a database company developing RisingWave, a distributed SQL database for stream processing. Before running the company, Yingjun was a software engineer at the Redshift team, Amazon Web Services, and a researcher at the Database group, IBM Almaden Research Center. He has been working in the field of stream processing and database systems for over a decade.
🌟 Iceberg at Netflix
Netflix's Iceberg past, present, and future (call out to community for where they see the technology challenges). Netflix will briefly cover our journey from Hive to Iceberg, current systems with catalog, compaction, and replication, and the improvements we're making.
**Snehal Chennuru** is an engineering manager for the Big Data Warehouse team at Netflix, with over a decade of experience building distributed systems at Netflix, Skyhigh Networks, and Clearwell Systems.
**Bryan Keller** is a software engineer on the Big Data Warehouse team at Netflix, with over a decade of experience building big data systems. He is also an early Iceberg advocate and Iceberg committer.
**Tim Jiang** is a software engineer on the Big Data Warehouse team at Netflix. Over the past few years, he has focused on strengthening data security for Iceberg and query engines.

Apache Iceberg Bay Area Community Meetup