Hands-on workshop: Version control your data lake with lakeFS


Details
It's time to crack open a laptop and get to work with our lakeFS Hands-on Workshop!
🚨Prerequisite: You must bring a laptop with docker installed. 🚨
--------
🤝Organizer : lakeFS
📍Location: TBD
🍕Catering: Pizza & Drinks
👩🏻💻👨🏾💻 Who is it for:
This workshop is open to all folks in the data community who want to learn and grow their Data Engineering skills. Students, Professionals, and Careers changers are all welcome to join and learn about the best strategies & workflows in the data community.
--------
📚 What will we be covering:
The first problem faced with big data was the feasibility of processing data at such a high scale. In solving the scale problem, people developed technologies we know today like Kafka, Spark, Presto, Snowflake, and many others powering big data operations today.Now the problem people face is one of manageability. People no longer ask if they can handle a dataset but rather: How can I move faster when developing data-intensive applications? How do I utilize all of my data and ensure it is high-quality?Learn how to simplify the management of a data lake by enabling git-like operations over files in object storage
--------
📋Agenda:
- 5pm - 5:30pm | Check in/Networking
- 5:30pm - 5:45pm | Intro to lakeFS
- 5:45pm - 6:30pm | How to use lakeFS on top of your object store and build a data repository.
- 6:30pm - 7:15pm | How to use lakeFS to create test data environment with zero-copy cloning of prod data.
- 7:15pm - 7:30pm | Closing Remarks & Final Q&A
- 7:30pm - 8pm | Networking
--------
➡️ Join the lakeFS Slack community: https://lakefs.io/slack
- lakeFS is an open source data version control for data lakes.
- It enables zero copy Dev / Test isolated environments, continuous quality validation, atomic rollback on bad data, reproducibility, and more.
- Learn more: https://lakefs.io/
COVID-19 safety measures

Canceled
Hands-on workshop: Version control your data lake with lakeFS