Open source, open pipelines: Data ingestion with modern data stack


Details
Workshop
Data ingestion is the cornerstone of Data Engineering — it’s where every data journey begins. In this hands-on workshop, you’ll learn how to move data from anywhere to anywhere using the open-source modern data stack.
We’ll focus on practical skills, leveraging Python library dlt (data load tool) to ingest data from a REST API and load it into DuckDB, a fast and lightweight database. Whether you're just getting started with data pipelines or looking to modernize your current stack, this session will give you a solid foundation for building reliable, open-source ingestion workflows.
Come ready to write some code, get your hands dirty, and walk away with real-world ingestion superpowers.
Speaker
Violetta Mishechkina
Violetta Mishechkina is a Solutions Engineer at dltHub.
She has been working in the data field since 2018, with a background in machine learning.
Violetta started as a Data Scientist, training ML models and neural networks. A year ago, she joined dltHub’s Solutions Engineering team and discovered dlt, a Python library that automates 90% of tedious data engineering tasks. Now, she works closely with customers and partners to help them integrate and optimize dlt in production. Violetta also collaborates with her development team as the voice of the customer, ensuring the product meets real-world data engineering needs.
📆 Agenda
18:00 - Introduction
18:10 - Workshop
19:25 - Closing words and announcements
GitHub Repo
https://github.com/pyladiesams/data-ingestion-modern-stack-apr2025
Stream
YouTube stream
📧 Contact
Are you interested in speaking at one of our events? Have a good idea for a Meetup? Get in touch with us at amsterdam@pyladies.com
💬 Find us on the PyLadies Global workspace:
- https://slackin.pyladies.com enter your email address.
Accept the email invitation - Go to workspace https://pyladies.slack.com
- Join channel #city-amsterdam

Open source, open pipelines: Data ingestion with modern data stack