Introduction to Apache Arrow and The Gandiva Initiative


Details
Location: https://g.co/kgs/WJBXEB
Title: Introduction to Apache Arrow and The Gandiva Initiative
Speaker: Ravindra Pindikura
Apache Arrow is designed to make things faster. It’s focused on speeding communication between systems as well as processing within any one system.
In this talk, Ravindra will start by discussing what Arrow is and why it was built. This will include covering an overview of the key components, goals, vision and current state. Ravindra will then take the audience through a detailed engineering review of how we used Arrow to solve several problems when building the Apache-Licensed Dremio product.
The Gandiva Initiative was recently released as a new engine for evaluating expressions on Arrow buffers using LLVM JIT compilation. In this talk Ravindra will talk through the goals of Gandiva, how it was implemented, and show some examples of massive performance improvements we are already seeing from this initiative.
https://github.com/dremio/gandiva
This will be a highly technical talk targeted towards people building data infrastructure systems and complex workflows.

Introduction to Apache Arrow and The Gandiva Initiative