The Cost of Choosing to Not Version Your Data


Details
Presenters:
Gavin Medel-Gleason - CTO at TerminusDB
Paul Singman - Developer Advocate at Treeverse (lakeFS)
Agenda (times in EDT; CEST start time is 6:30pm; PDT start is 9:30am):
12:30-12:35pm: Introduction by Scott Hirleman
12:35-1:15pm: Presentations by Gavin and Paul
Post 1:15pm: Q&A
Format is Zoom broadcast*. We will upload to YouTube if you can't make it.
Per Zhamak, versioning is a necessary capability of data products in data mesh, similar to the DevOps world. However, versioning with data/data products isn't just the code to produce/support the data products but also covers versioning of the data itself.
Gavin (TerminusDB) and Paul (lakeFS) will discuss all things data versioning. Specifically, they'll cover:
- What it means to create a version of data
- How modern data tools give support data versioning natively via git-like operations
- The benefits of data versioning, including faster development and confident deployment of data
- Recommended versioning strategies and performance/cost considerations
Finally, they'll be available to answer any and all questions in the Q&A session that follows.
*There is no requirement to "register" for the webinar after signing up for the meetup, it is just a normal link. We do not have access to your email address via meetup nor do we want to :)

The Cost of Choosing to Not Version Your Data