Tue, May 19 · 6:30 PM CEST
Hello PyData People!
We are excited to announce our next event of 2026! This time, we will be hosted at Generali Italia’s Torre Generali in Milan for an evening dedicated to open data, data platform orchestration, and scalable data engineering.
📅 When: Thursday, May 19th, 2026 – 18:30–21:00
📍 Where: Torre Generali Italia, Milan
⚠️ Important: Spots are limited. Please keep your RSVP updated to allow others to participate if you can no longer attend.
***
🕒 Agenda
18:30 – Doors open & check-in
19:00 – Talk 1: Democratizing Data: A deep dive into Eurostat Open Database – Simona Mazzarino
19:45 – Talk 2: Scaling Data Mesh Orchestration with Dagster: Platform-Driven DAGs without Platform Friction – Marco Santoni and Andrea Romeo
20:30 – Networking & Social Dinner
***
🎤 The Talks
1️⃣ Democratizing Data: A deep dive into Eurostat Open Database
Speaker: Simona Mazzarino (Data Scientist @ Clearbox AI)
In this talk, we explore how to turn open data into actionable insights. You will learn how to navigate Eurostat, the EU’s vast statistical database, and use Python tools to fetch and process data for research, AI, and real-world analysis. We will look at how to discover relevant datasets, work with Eurostat’s interface and APIs, and integrate public statistics into your workflows for visualizations, dashboards, and models. A guiding example will be the analysis of migration flows across European countries, from data discovery to a usable dataset for analytics or machine learning.
About the Speaker:
Simona Mazzarino is a Data Scientist at Clearbox AI. With a background in linguistics, semiotics, and artificial intelligence, she specializes in language technologies. She is an active volunteer in the Python Torino community, where she helps connect people interested in Python applications, and she also volunteers for PyCon Italia.
***
2️⃣ Scaling Data Mesh Orchestration with Dagster: Platform-Driven DAGs without Platform Friction
Speakers: Andrea Romeo, Platform Engineer @ TeamSystem
Data Mesh promises scalability and autonomy, but orchestration often becomes the hidden bottleneck: as data products grow, cross-domain dependencies explode, DAGs become fragile, and upstream teams are forced to react to every new consumer.
In this talk, we present a platform-designed orchestration solution built on top of Dagster that enables company-wide orchestration without coupling data products together. Instead of requiring data engineers to manually define DAGs or modify upstream pipelines, each data product simply declares its dependencies on upstream output ports. From these declarations, the platform automatically builds and maintains the global DAG behind the scenes, ensuring that upstream data products remain unaffected by new consumers, orchestration scales as the number of data products grows, and platform standards are enforced without slowing down teams.
We’ll walk through:
the orchestration challenges we faced when adopting data mesh at scale
how Dagster was extended and shaped into a platform abstraction, not exposed directly to product teams
the Python-based design patterns used to dynamically generate and evolve DAGs
the resulting developer experience for data engineers building data products
This session is a practical, production-tested story of how Python and Dagster can be used to enable scalable orchestration in a real data mesh implementation — without turning the platform team into a bottleneck.
About the Speakers:
Andrea Romeo is a Platform Engineer at TeamSystem, where he focuses on building and evolving the technical foundation behind data infrastructure. He works closely with development and data teams to design scalable, resilient platforms that support data-driven products and services, with a strong focus on backend, cloud technologies, and developer experience.
***
See you there!
The PyData Milano Team