PyData 2022 November Edition


Details
Welcome to the November Pydata Berlin edition!!
For everybody to feel safer, we recommend you test yourself against COVID-19 before coming to the event. A self-test or a rapid antigen test would suffice. And please refrain from coming to the event if you feel unwell.
Doors open at 18:45 and food will be served at 19:00. Please be on time!
***
Talks:
Paolo Tamagnini
Reliable and Reusable Python Code Sharing With KNIME
KNIME Analytics Platform is a no-code/low-code open source and totally free desktop software. Users can drag and drop nodes on a canvas to build their custom data analysis. With the release of KNIME 4.6 (June 2022) devs can now build new nodes for the KNIME community also in Python, not just Java. This talk is a tutorial to teach the Python community of devs on how to make their library more accessible to no-code/low-code users by developing KNIME nodes in pure Python. The node can be shared then in the open source KNIME ecosystem or privately within an organization.
Bio
Paolo Tamagnini is a senior data scientist at KNIME. After graduating with a master’s degree in data science at Sapienza University of Rome, Paolo gathered research experience at New York University in machine learning interpretability and visual analytics tools. Since working at KNIME, Paolo has presented different workshops in the USA and Europe, and led the KNIME verified component project: a monthly release of reusable low-code/no-code applications for different data science techniques (autoML, XAI, Time Series, NLP, ..) and verticals (finance, marketing, supply chain, ..).
Break
Arne Tarara
Energy measurement and estimation of Cloud Infrastructure and Workloads
Energy cost estimation for cloud workloads is an emerging topic that surfaces especially in the advent of the current energy crisis. We will present techniques to estimate the energy cost that have been developed so far and an open source approach using the SPECPower dataset. Also, we will be discussing measurement techniques if available with open source tools like CodeCarbon for ML Models and Intel RAPL for CPU-bound workloads.
Bio
Arne Tarara works for Green Coding Berlin, which is a Berlin-based software company focused on creating open-source and measurements in the domain of software energy consumption. He has been working as a software developer for the last 16 years mainly in the web domain with a strong background in analytics and linear modeling.
NumFOCUS Code of Conduct
THE SHORT VERSION
Be kind to others. Do not insult or put down others. Behave professionally. Remember that harassment and sexist, racist, or exclusionary jokes are not appropriate for NumFOCUS.
All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery are not appropriate.
NumFOCUS is dedicated to providing a harassment-free community for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of community members in any form.
Thank you for helping make this a welcoming, friendly community for all.
If you haven't yet, please read the detailed version here: https://numfocus.org/code-of-conduct
***
SPONSORS:
CODE is a Berlin-based state-accredited university of applied sciences that offers Bachelor’s degree programs in the field of Digital Product Development.
COVID-19 safety measures

PyData 2022 November Edition