PyData Budapest Online #4 - Dataframe Evolution


Details
In this meetup we will cover the evolution of the solutions aiming to improve the well-known and beloved pandas dataframe, a key component of the Python data ecosystem.
Attention: External registration required, please see below!
If you ever wondered who you could make Python data processing faster, then it's time to meet new tools:
-
Modin: a light-weight parallel DataFrame using a column-store approach which scales up to 1TB+ datasets.
https://github.com/modin-project/modin -
Vaex.io: a library for memory-mapped, out-of-core processing of Python dataFrames, scaling up to a billion rows per second.
https://github.com/vaexio/vaex -
CuDF: a dataframe library for loading, joining, aggregating, filtering, and otherwise manipulating data using the power of the GPU.
https://github.com/rapidsai/cudf
Our speakers:
Devin Petersohn - Modin
PhD student at UC Berkeley and ML engineer @ Intel, creator of Modin
https://www.linkedin.com/in/devinpetersohn
Jovan Veljanoski - Vaex
Senior data scientist @ Cloud Technology Solutions and co-founder of Vaex.io
https://www.linkedin.com/in/jovanvel
Miguel Martínez - cuDF
Senior Deep Learning Data Scientist @ NVIDIA
https://www.linkedin.com/in/miguelusque
Schedule (all time CEST):
16:30 Optional pre-meetup intro training to pandas in Hungarian
18:00 Meetup starts
18:05 Welcome talk by Bence Arato and Terry Foor (NumFOCUS)
18:15 Modin talk
18:45 Vaex talk
19:15 cuDF talk
19:45 Q&A
Update: The pre-meeting intro training is full.
Registration/Tickets:
Our meetups are non-profit and free to attend, but this time we are raising funds for NumFOCUS, a non-profit that helps sustain Python's scientific computing and data ecosystem. If you are using Python professionally then please consider buying an optional supporting ticket. Details and registration:
https://www.cognitoforms.com/BIC1/PyDataBudapest4DataframeEvolution
For Q&A and general chat, please join the PyData Budapest slack community: https://bit.ly/pydata-budapest-slack
This is an English speaking event except the pre-meetup training session.

PyData Budapest Online #4 - Dataframe Evolution