PyData Kaunas: Data Lineage and Large Dataset Annotation


Details
1. Tomas Peluritis (Head of Data @Mediatech) - Data Lineage: Where 'It Depends' Finally Gets an Answer
The talk will be a detective story of tracking data through the enterprise maze, solving mysteries of broken pipelines, and transforming spaghetti SQL into a well-documented feast. This session investigates modern tools and techniques that help data teams sleep better at night, knowing exactly where their data came from - and where it went.
With over a decade of experience in data engineering, Tomas (also known as Duomenų Dėdė/Uncle Data) has worked across companies of various sizes, from startups to enterprises. He actively contributes to the data community through technical writing and speaking in various events, sharing practical insights from real-world implementations.
2. Audrius Kučinskas (carVertical CTO) - Scaling Data Annotation
In this session, we'll explore how to efficiently annotate large datasets by leveraging the expertise of subject matter experts. We'll discuss scalable approaches to managing data annotation workflows and tools for automating tasks while maintaining high-quality results.
Audrius Kučinskas co-founded carVertical seven years ago and has been its CTO since then. He has a broad and varied experience in tech and business development and currently leads 70 professionals.
carVertical was ranked among the Fastest Growing European Companies in 2024 by the Financial Times.
---
This meetup is co-organized with carVertical. Thanks for your collaboration!
---
Just a reminder, got a topic? We would like to hear your presentation!
Even lightning talks (<5 min) are welcome!
---
Note: The event will take place at BLC (D building, 5th floor, Edison hall).
We'll have snacks and drinks ready for you!

Sponsors
PyData Kaunas: Data Lineage and Large Dataset Annotation