Skip to content

GenAI over data lakes: build reliable data pipelines in an ever-changing world

S
Hosted By
Shir R. and 4 others
GenAI over data lakes: build reliable data pipelines in an ever-changing world

Details

**Please note that you will be required to bring a government-issued ID that matches the name on your registration **

Event schedule:

  • 6:00pm - Grab a seat
  • 6:30 - GenAI over data lakes: build reliable data pipelines in an ever-changing world, talk by Jacopo Tagliabue
  • 7:20 - Pizza & Beer and continue the discussion with other practitioners
  • 8:00 - Event close

About the talk:
Reproducibility is a major obstacle in debugging data and AI projects, and in moving them from development to production. As GenAI favors data lakes and storage-compute decoupling, conventional approaches to replayability - from software engineering to warehouses - reveal critical limitations when confronted with modern data workloads.

Starting from a reproducibility checklist, we sketch a decision tree over open source and commercial tools built for data lakes. We highlight challenges and opportunities for vertically integrating point-wise solutions into a full lakehouse, and conclude with a small demo of our own lakehouse at Bauplan.

Address: 100 6th Ave, New York, NY 10013, United States
Google Maps Link: https://maps.app.goo.gl/8nHWdKy2dCyyUcKU6

Photo of NYC Artificial Intelligence & Machine Learning group
NYC Artificial Intelligence & Machine Learning
See more events
100 6th Ave
100 6th Ave · New York, NY