Accelerating Data Processing with Apache DataFusion

Name: Accelerating Data Processing with Apache DataFusion
Start: 2025-07-31T18:00:00+01:00
End: 2025-07-31T19:00:00+01:00

Hosted by Opeyemi

York Database Internals Meetup Group

Details

Join us for a focused, hands-on training session on Apache DataFusion, the modern, extensible query execution engine built in Rust. This short training is designed to give data engineers, analysts, and developers a practical introduction to leveraging DataFusion for high-performance, in-memory query processing on structured data.
In this session, you'll learn:

What Apache DataFusion is and how it fits into the modern data stack
Core concepts: logical plans, physical plans, and execution contexts
How to query data using SQL and DataFrames
Practical use cases including CSV, Parquet, and in-memory tables
Extending DataFusion with custom functions and datasources

Who Should Attend:
This training is ideal for engineers and architects working with large-scale data processing pipelines, especially those exploring Rust-based solutions or looking to integrate DataFusion into their existing infrastructure.
Format:
The session will include a mix of presentations and live coding. Attendees will leave with working examples and the knowledge to begin experimenting with DataFusion in their environments.

Accelerating Data Processing with Apache DataFusion

York Database Internals Meetup Group

Details

You may also like