Building a Performant Data Lake House using Apache Iceberg & Trino Query Engine


Details
Data Lakehouse is the next generation of the data warehouse and data lake, designed to meet today's complex and ever-changing analytics, machine learning, and data science requirements
Apache Iceberg is a new table format that solves the challenges with traditional catalogs and is rapidly becoming an industry standard for managing data in data lakes
Trino is an open-source distributed SQL query engine designed to query large data sets distributed over one or more heterogeneous data sources
Event details
Join Trino and Iceberg experts at the Starburst (Trino's parent company) office in the Seaport for the next Meetup. There will be pizza, salad, beverages and free Trino swag! The food will be served at 5:30 and then starting at around 6:00pm, Monica Miller, Developer Advocate at Starburst and Brian Olsen, Head of Developer Relations at Tabular will be doing a talk on building Data Lakehouse using Trino and Iceberg.
More information on the talk
We are introducing the cutting-edge open source data stack for storing and querying data better than ever before: Trino and Iceberg.
- Iceberg is an open-source open table format that automatically handles and hides partitioning, allowing you to take advantage of data lake speeds without the need for manual, tedious, and error-prone management.
- Trino is an open-source distributed SQL query engine, capable of running ad hoc queries at blazing fast speeds and handling long-running ETL workloads with its fault-tolerance mode
Both are great technologies independently, but it's together that they shine. Iceberg's storage format enables incredibly speedy reads and writes to pair with the speed of the Trino engine, and Trino empowers data scientists and analysts to query Iceberg tables with ANSI SQL and the interface and BI tool interoperability of a traditional data warehouse. Combine the benefits of a data lake and a data warehouse, and you get a data lakehouse. The new, better way to manage your data with Trino on Ice.
Address
Starburst Office
Floor 2
320 Summer Street
Boston, MA
02210
Speakers:
Monica Miller
Starburst
Brian Olsen
Tabular
Alexander Jo
Software Engineer, Starburst
COVID-19 safety measures

Building a Performant Data Lake House using Apache Iceberg & Trino Query Engine