

What we’re about
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other.
The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
The PyData Code of Conduct governs this meetup. To discuss any issues or concerns relating to the code of conduct or the behavior of anyone at a PyData meetup, please contact NumFOCUS Executive Director Leah Silen (+1 512-222-5449; [leah@numfocus.org](mailto:leah@numfocus.org)) or the group organizer.
We run monthly meetups at changing locations and have organized six conferences, in 2014, 2015, 2016, 2017, 2019, and 2022. You can see our latest meetups, submit a talk idea and read PyData blog posts on our site : https://berlin.pydata.org.
Please get in touch using info@pydata.berlin.
Twitter: @pydataberlin
Upcoming events (3)
See all- PyData Berlin 2025 March MeetupHussitenstraße 32, Berlin
Welcome to the PyData Berlin March meetup!
We would like to welcome you all starting from 18:45. There will be food and drinks. The talks begin around 19.30 and the doors will close at 19:30. Make sure to arrive on time!
*** Important!! ***
Please keep in mind that there is a BVG strike on this day, affecting U-Bahn, trams, and buses. S-Bahn and regional trains will work.Please provide your first and last name for the registration because this is required for the venue's entry policy. If you cannot attend, please cancel your spot so others are able to join as the space is limited.
Host:
Bonial is excited to welcome you to this month's version of PyData.
**************************************************************************
The Lineup for the eveningTalk 1: Extract structured product & deal information from PDFs on scale via LLM
Abstract: Bonial shows hundreds of thousands of offers from local brick-and-mortar retailers on its platform, a subset of this content is retrieved from PDF files. In this talk I’ll explain how we leverage LLM to parse unstructured PDF files to create content on our platform.Speaker: Philipp Johannis has been part of Bonial for 12 years. He established and leads the Data Department, which consists of multiple Analytics, Engineering & Data Science teams, and is currently serving as Head of Data. He focuses on improving the data platform and enabling and supporting the development of various data driven products such as personalisation and traffic management.
Talk 2: Airweave, an Open-Source Tool To Turn Any App Into Accessible Agent Knowledge
Abstract: The talk will be an introduction to Airweave, which is an open-source Python tool that helps agent developers turn app data into accessible knowledge for AI agents. It connects to any app, database, URL, or API and structures the data for retrieval. Airweave automates authentication, ingestion, enrichment, mapping, and syncing to vector stores and graph databases of choice. It has a search layer for agents out-of-the-box and allows extension of the platform with minimal code. Developers can use Airweave via our web UI, REST API, or SDKs.Speakers: Lennert Jansen and Rauf Akdemir are the creators of Airweave AI. Lennert is an AI Engineer & Researcher with a background in Applied Statistics and Deep Learning for NLP. Before Airweave, he worked on AI & Bayesian Statistics at Amazon, IBM, and the University of Amsterdam. Rauf is a CS graduate from Technical University of Delft, with strong engineering experience in productionising ML & data infrastructure in both start-ups and enterprise.
Lightning talks
There will be slots for 2-3 Lightning Talks (3-5 Minutes for each).
Kindly let us know if you would like to present something at the start of the meetup :)***
NumFOCUS Code of Conduct
THE SHORT VERSION
Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS.
All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate.
NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form.
Thank you for helping make this a welcoming, friendly community for all.
If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct
*** - Network event83 attendees from 134 groups hostingPyData Virginia 2025Violet Crown Charlottesville, Charlottesville, VA
Mark your calendars for PyData Virginia 2025, coming to Charlottesville on April 18-19! This two-day, in-person event at the Violet Crown Cinema is the ultimate place for data scientists, engineers, and developers to swap ideas and learn new info.
Want to share your ideas?
Submit your talk proposal by Feb. 10!Snag your tickets before they're gone!
Buy Here - Network event231 attendees from 135 groups hostingPyData London 2025Needs location
Get ready to unleash your inner data aficionado at PyData London 2025, happening June 6-8 at Convene Sancroft, St. Paul’s! This three-day, in-person event is your golden ticket to dive into live keynotes, talks, and lightning sessions alongside fellow data enthusiasts.
Have an idea you want to share? Submit your talk proposal by Feb. 24. Tickets sold out in 2024, so don’t wait—grab yours today! Buy Here