What we’re about
An online meetup community for practitioners, developers, aspiring and professional data engineers and data scientists, who are interested in learning about data + AI. Join this group to connect with fellow enthusiasts and to learn more about open source projects including Apache Spark, Delta Lake, MLflow, Koalas, TensorFlow and PyTorch.
We host three types of live online meetups which we'll call out in the title of each event. Most meetups will be recorded and the videos will be posted here: https://dbricks.co/youtube-meetups
Interviews: Interview style with time for Q&A, no slides
Tech Talks: Presentation, slides, demo and time for Q&A
Workshops: Tutorials with time for Q&A
Upcoming events (1)See all
- Delta Lake Deep Dive: DeltaTorchLink visible for attendees
Delta Lake storage format gives deep learning practitioners unique data management capabilities for working with their datasets. The challenge is that, as of now, it’s not possible to use Delta Lake to train PyTorch models directly.
PyTorch community has recently introduced a Torchdata library for efficient data loading. This library supports many formats out of the box, but not Delta Lake. This Delta Lake Deep Dive with Michael Shtelma will demonstrate using the Delta Lake storage format for single-node and distributed PyTorch training using the torchdata framework and standalone delta-rs Delta Lake implementation. Let's dive in! 🌊