Skip to content

PyData Berlin 2025 March Meetup

Photo of Carolina Shimabukuro
Hosted By
Carolina S. and 4 others
PyData Berlin 2025 March Meetup

Details

Welcome to the PyData Berlin March meetup!

We would like to welcome you all starting from 18:45. There will be food and drinks. The talks begin around 19.30 and the doors will close at 19:30. Make sure to arrive on time!

*** Important!! ***
Please keep in mind that there is a BVG strike on this day, affecting U-Bahn, trams, and buses. S-Bahn and regional trains will work.

Please provide your first and last name for the registration because this is required for the venue's entry policy. If you cannot attend, please cancel your spot so others are able to join as the space is limited.

Host:
Bonial is excited to welcome you to this month's version of PyData.
**************************************************************************
The Lineup for the evening

Talk 1: Extract structured product & deal information from PDFs on scale via LLM
Abstract: Bonial shows hundreds of thousands of offers from local brick-and-mortar retailers on its platform, a subset of this content is retrieved from PDF files. In this talk I’ll explain how we leverage LLM to parse unstructured PDF files to create content on our platform.

Speaker: Philipp Johannis has been part of Bonial for 12 years. He established and leads the Data Department, which consists of multiple Analytics, Engineering & Data Science teams, and is currently serving as Head of Data. He focuses on improving the data platform and enabling and supporting the development of various data driven products such as personalisation and traffic management.

Talk 2: Airweave, an Open-Source Tool To Turn Any App Into Accessible Agent Knowledge
Abstract: The talk will be an introduction to Airweave, which is an open-source Python tool that helps agent developers turn app data into accessible knowledge for AI agents. It connects to any app, database, URL, or API and structures the data for retrieval. Airweave automates authentication, ingestion, enrichment, mapping, and syncing to vector stores and graph databases of choice. It has a search layer for agents out-of-the-box and allows extension of the platform with minimal code. Developers can use Airweave via our web UI, REST API, or SDKs.

Speakers: Lennert Jansen and Rauf Akdemir are the creators of Airweave AI. Lennert is an AI Engineer & Researcher with a background in Applied Statistics and Deep Learning for NLP. Before Airweave, he worked on AI & Bayesian Statistics at Amazon, IBM, and the University of Amsterdam. Rauf is a CS graduate from Technical University of Delft, with strong engineering experience in productionising ML & data infrastructure in both start-ups and enterprise.

Lightning talks
There will be slots for 2-3 Lightning Talks (3-5 Minutes for each).
Kindly let us know if you would like to present something at the start of the meetup :)

***
NumFOCUS Code of Conduct
THE SHORT VERSION
Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS.
All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate.
NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form.
Thank you for helping make this a welcoming, friendly community for all.
If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct
***

Photo of PyData Berlin group
PyData Berlin
See more events
Hussitenstraße 32
Hussitenstraße 32 · Berlin, BE