Using Apache Iceberg for multi-function analytics in the cloud


Details
We know what you're thinking. Icebergs...in the cloud? How's that work? But it isn't a (mixed) metaphor. Iceberg is a high-performance format for huge analytic tables. Iceberg brings the reliability and simplicity of SQL tables to big data with a growing set of compute engine support to safely work with the same tables, at the same time, for advanced analytics use cases. Unlike proprietary table formats that some vendors make you use, Apache Iceberg is an open standard that can be used by any processing engine.
What the session will cover
During this meetup, we’ll assume you’ve never heard of Apache Iceberg and explain the basics: what problems the Apache Iceberg project is addressing, how iceberg works, what features iceberg tables offer and how you can put Iceberg to work in your own data projects that utilize multi-function analytics such as Hive, Spark, or Impala.
We’ll show you during a live demonstration how to create Iceberg tables for multi-function analytics and apply advanced techniques such as Time Travel and Partition Evolution, all in the cloud as available in Cloudera Data Platform.
Agenda
6:00 - 6:45 PM EST: Presentation - Bill Zhang, Director of Product Management at Cloudera
6:45 - 7:15 PM EST: Demonstration - Luiz Carrossoni Neto, Cloudera Principal Solutions Engineer
7:15 - 7:30 PM EST: Q&A and possibly a raffle for participants with Navita Sood
For a preview of what we'll be covering, we've offer the following related resources for your consideration:
Blog Post:
Introducing Apache Iceberg in Cloudera Data Platform
Community Article:
Using Iceberg Table Format in CDP Public Cloud to Ingest, Process and Analyze Stock Intraday Data
Cloudera Users Page:
https://www.cloudera.com/users.html
Presenters
Join Bill Zhang, Luiz Carrossoni Neto and Navita Sood, all of Cloudera, and get acquainted with Apache Iceberg. We are looking forward to seeing you there!
This is still a tricky time for public gatherings, but Future of Data is committed to providing great tech content & facilitating discussions in the "Big Data" space. Our group in Harrisburg, Pennsylvania is holding this event; in order to do our part to fight the spread of COVID-19's Omicron variant, this will be an exclusively online event originating in Eastern Standard Time (however the event time displayed on this page will reflect the equivalent local time). We thought it might be of interest to our wider membership (you are welcome to register for it here).

Using Apache Iceberg for multi-function analytics in the cloud