GenAI over data lakes: build reliable data pipelines in an ever-changing world

Details
**Please note that you will be required to bring a government-issued ID that matches the name on your registration **
Event schedule:
- 6:00pm - Grab a seat
- 6:30 - GenAI over data lakes: build reliable data pipelines in an ever-changing world, talk by Jacopo Tagliabue
- 7:20 - Pizza & Beer and continue the discussion with other practitioners
- 8:00 - Event close
About the talk:
Reproducibility is a major obstacle in debugging data and AI projects, and in moving them from development to production. As GenAI favors data lakes and storage-compute decoupling, conventional approaches to replayability - from software engineering to warehouses - reveal critical limitations when confronted with modern data workloads.
Starting from a reproducibility checklist, we sketch a decision tree over open source and commercial tools built for data lakes. We highlight challenges and opportunities for vertically integrating point-wise solutions into a full lakehouse, and conclude with a small demo of our own lakehouse at Bauplan.
Address: 100 6th Ave, New York, NY 10013, United States
Google Maps Link: https://maps.app.goo.gl/8nHWdKy2dCyyUcKU6

GenAI over data lakes: build reliable data pipelines in an ever-changing world