Skip to content

PyData February: SQL w/ DuckDB & enterprise data orchestration

Photo of Cor Zuurmond
Hosted By
Cor Z.
PyData February: SQL w/ DuckDB & enterprise data orchestration

Details

Another month, another meetup! This time at the lovely central OBA (Openbare Bibliotheek Amsterdam), hosted by wonderful folks over at DuckDB Labs.

Join us at the OBA (Oosterdokkade) for two fascinating talks. Pedro Holanda will present DuckDB, a novel data management system that executes analytical SQL queries without requiring a server.
Bolke de Bruin & Jeff Fletcher will dive into how to make data orchestration sexy with Apache Airflow (and free tools from Astronomer)

Directions with the OBA:
We start with the buffet on the 7th floor. After that, the Meetup is held at a room on the 6th floor.

Schedule:
17.30 - 🍲 Welcome with a buffet!
18:30 - 🎤 DuckDB: Bringing analytical SQL directly to your Python shell - Pedro Holanda (DuckDB)
19:15 - ⏸️ Break
19:30 - 🎤 Sexy Enterprise Data Orchestration - Bolke de Bruin & Jeff Fletcher (Astronomer)
20:15 - 🥤 Drinks

🎤 DuckDB: Bringing analytical SQL directly to your Python shell
Pedro Holanda | DuckDB Labs

In this talk, we will present DuckDB. DuckDB is a novel data management system that executes analytical SQL queries without requiring a server. DuckDB has a unique, in-depth integration with the existing PyData ecosystem. This integration allows DuckDB to query and output data from and to other Python libraries without copying it. This makes DuckDB an essential tool for the data scientist. In a live demo, we will showcase how DuckDB performs and integrates with the most used Python data-wrangling tool, Pandas.

The talk is catered primarily toward data scientists and data engineers. The talk aims to familiarize users with the design differences between Pandas and DuckDB and how to combine them to solve their data science needs. We will give an overview of the five main characteristics of DuckDB and show a live demo of DuckDB and Pandas in a typical data science scenario, focusing on comparing their performance and usability while showcasing their cooperation. The demo is most interesting for an audience familiar with Python, the Pandas API, and SQL.

🧑 Speaker: Pedro Holanda
Pedro Holanda is a computer scientist with a background in database architectures. He completed his Ph.D. at CWI in Amsterdam, where he specialized in indexing for interactive data analysis. He is a prominent contributor to the open-source database management system, DuckDB. He is the COO of DuckDB Labs, a company that provides services and support for DuckDB.

🎤 Sexy Enterprise Data Orchestration
Bolke de Bruin & Jeff Fletcher | Astronomer

Harvard Business Review has yet to confirm it, but we think Enterprise Data Orchestration is even sexier than Data Science. In this talk, we will dive into how to make data orchestration sexy with Apache Airflow and how to sprinkle it with some free tools from Astronomer to make your life a bit easier.

🧑 Speaker: Bolke de Bruin
Bolke de Bruin is VP of Enterprise Data Services for Astronomer. He has a passion for data orchestration and how it can make organizations more agile. Before joining Astronomer in 2022, Bolke worked at ING where he built the company’s data analytics platform. Before that, he worked at the 2004 Summer and 2006 Winter Olympics, managing technology, communication, and data requirements for all news & media. Bolke is also the VP of Apache Airflow, the leading open-source data orchestration engine. In his spare time, Bolke is a guest lecturer at the University of Nyenrode, a fun father to Mattia and Timo, and can be found surfing, obstacle running, or taking in a museum when the opportunity arises.

🧑 Speaker: Jeff Fletcher
Jeff Fletcher is the Director of Field Engineering, Machine Learning at Astronomer, where he helps customers orchestrate machine learning pipelines as part of their Modern Data Orchestration platform. Previously he was a machine learning specialist at Cloudera. Before that, he led a product development team at Dimension Data, designed and implemented internet services at Verizon Business, and founded Antfarm, South Africa’s first streaming company. He has a side hobby doing dataviz and was shortlisted for an Information Is Beautiful award. Jeff holds a degree in electrical engineering from Wits and plays bass, and instructs synthesizers for One Might Atom.

Photo of PyData Amsterdam group
PyData Amsterdam
See more events