Skip to content

Speedrunning the Lakehouse e AMA with Jacopo Tagliabue

Photo of Giulio Mazzanti
Hosted By
Giulio M. and 4 others
Speedrunning the Lakehouse e AMA with Jacopo Tagliabue

Details

✨ πŸš€ Surprise Event with special guest: Jacopo Tagliabue 🐍 ✨

A surprise event where we will have Jacopo Tagliabue from Bauplan with a technical talk about Speedrunning the Lakehouse and an AMA (Ask-Me-Anything) session where you ask anything related to data, career advice or whatever (did you know Jacopo worked with Olimpia Milano at the beginning of his career?).

Agenda

  • πŸ‘‹ 18:30 Doors Open: Food and Socializing
  • 🐍 19:00 Talk: Speedrunning the Lakehouse
  • 🎀 19:45 Ask-Me-Anything with Jacopo Tagliabue
  • 🍻 After the meetup we will organize a dinner close to the venue, anybody who wants to join is welcome!

πŸ“ Venue: Team System, Via Giovanni Battista Pirelli 35, 5th floor (Gioia M2, Centrale M3, Isola/Garibaldi M5)
🀝 Event in collaboration with AgileLab, who is sponsoring the food.
πŸ“Ί The event will be recorded and streamed on Python Milano Youtube channel (links will appear here close to the event)

πŸƒπŸ‘ Speedrunning the Lakehouse
The lakehouse architecture has become a foundational design for modern data and AI workloads. But this flexibility comes at a cost: users and system developers must navigate multiple APIs, conflicting abstractions, and overlapping execution models. What if we started from scratch, with simplicity in mind? In this talk, we discuss the technical challenges of building a "Function-as-a-Service" (FaaS) lakehouse: if workloads were β€œjust” chained functions, users and developers could easily reason about the full data lifecycle!

​We argue that existing FaaS platforms were never designed for data-intensive workflows. To address this, we built a new system from the ground up using object storage and open formats. Re-purposing lessons from OpenLambda, we deploy functions up to 15Γ— faster than AWS Lambda. By extending Apache Iceberg’s isolation with Git-like primitives, we support multi-language transactions with formal correctness proofs. Finally, we show how ephemeral functions, Arrow-native caching, and decoupled catalogs can simulate a full warehouse.

​We conclude by emphasizing the role of user-facing APIs for adoption in real-world settings, and sharing late-breaking results from our ongoing research.

βœ‹ Please use your real Name and Surname to register, we cannot assure you will be able to access the venue if not.
πŸ‘» If you have a last minute issue please remember to cancel the RSVP.
πŸ™ Thanks for your cooperation

Photo of Python Milano group
Python Milano
See more events
Team System
Via Giovanni Battista Pirelli, 35 Β· Milan
Google map of the user's next upcoming event's location
FREE
50 spots left