What we’re about
PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other.
The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.
The PyData Code of Conduct governs this meetup. To discuss any issues or concerns relating to the code of conduct or the behavior of anyone at a PyData meetup, please contact NumFOCUS Executive Director Leah Silen (+1 512-222-5449; leah@numfocus.org) or the group organizer.
We run monthly meetups at changing locations and have organized six conferences, in 2014, 2015, 2016, 2017, 2019, and 2022. You can see our latest meetups, submit a talk idea and read PyData blog posts on our site : https://berlin.pydata.org.
Please get in touch using info@pydata.berlin.
Twitter: @pydataberlin
Upcoming events (1)
See all- PyData Berlin 2024 June MeetupSpiced Academy, Berlin, BE
Welcome to the PyData Berlin June meetup!
We would like to welcome you all starting from 18:30. There will be food and drinks. The talks begin around 19.00.
Please provide your first and last name for the registration because this is required for the venue's entry policy. If you cannot attend, please cancel your spot so others are able to join as the space is limited.
Host:
SPICED Academy is excited to welcome you for this month's version of PyData. At SPICED Academy, we connect the next tech success stories to their futures. Our intensive bootcamps in Data Science, Data Analytics, UX/UI, Java Development and Web Development teach the most up-to-date, in-demand skills on the market.
**************************************************************************The Lineup for the evening
Talk 1: FormatAI: Structuring Unstructured Text using generative AI
Abstract: In the world of Natural Language Processing (NLP), the task of structuring unstructured text is akin to a translation task from free text into standardized formats like SQL or JSON. Generative AI is a great tool for this task, however, transformer-based translation models, such as GPT, sometimes struggle to adhere to the desired format, and to support the variety of expression of a natural language.
In this session as we delve into the intricacies of using transformer-based models to extract information from natural language to a predefined structure. We'll explore effective best practices, including the Constrained Generative AI, a technique that can entirely eliminate syntactic errors. Additionally, we will journey through preprocessing, data generation and augmentation methods and training strategies, showcasing how these methodologies helped us achieve near-perfect accuracy in extracting dates and times within textual data.
By the end of the session, you will gain insights into harnessing generative AI for structured output, opening doors to more accurate and efficient NLP applications.
Speaker: Oren Matar is a principal data scientist and algorithms developer, working at Anaplan, with a background in social sciences and Bayesian methods. I’m specializing in NLP and time series forecasting particularly in the field of supply chain forecasting.Talk 2: Probably Fun - games to teach statistics
Abstract: I am currently field-testing gamified lessons for my statistics classes. "gamified" means playing with cards, dice, puzzles and more in a face-to-face classroom and then analyze the underlying statistical concepts. The games are a great way to build intrinsic motivation in students, promote understanding and foster cooperation among groups.
Speaker: Kristian Rother is a freelance Python trainer who wrote his first lines of Python in the year `#07cf`. After a career writing software for life science research, he has been teaching Python, Data Analysis and Machine Learning throughout Europe since 2011. More recently, he has built data pipelines for the real estate and medical sector. Kristian has translated 5 Python books and written 2 more himself, in addition to numerous teaching guides. Kristian has collected 308 stars on Advent of Code. His knowledge about async is, unfortunately, miserable. His favorite Python module is `re`. Kristian believes everybody can learn programming. You can find Kristian's teaching materials on https://www.academis.euLightning talks
There will be slots for 2-3 Lightning Talks (3-5 Minutes for each).
Kindly let us know if you would like to present something at the start of the meetup :)***
NumFOCUS Code of Conduct
THE SHORT VERSION
Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS.
All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate.
NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form.
Thank you for helping make this a welcoming, friendly community for all.
If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct
***