Skip to content

Apache Arrow - Exploring the tech that powers the modern data (science) stack

Photo of Milen Chechev
Hosted By
Milen C.
Apache Arrow - Exploring the tech that powers the modern data (science) stack

Details

As a Data Scientist/Engineer in Python, we focus in our work on solving problems with large amounts of data. While a lot of problems can be solved with a single tool, there is often a combination of tools that allows one to solve things more efficiently. In most modern data tools, Apache Arrow is the backbone of enabling lightning-fast data interchange between all these tools. One of its most well-known features is the Parquet loader which is used in many dataframe libraries.

In this talk, we want to dive into the idea and impact of Apache Arrow and show which use cases were enabled through it. Furthermore, we want to give an understanding of how it works technically and how one can utilise the pyarrow library in various ways to ensure an efficient data interchange.

Program for the talk:
18:00 - Networking, Pizza and Beer
19:00 - Uwe to give a talk
20:00 - Networking part 2

Photo of PyData Sofia group
PyData Sofia
See more events
Entract 127
Old City Center, ulitsa „Georgi S. Rakovski“ 127 · Sofia