Skip to content

Accelerating Data Processing with Apache DataFusion

Photo of Opeyemi
Hosted By
Opeyemi
Accelerating Data Processing with Apache DataFusion

Details

Join us for a focused, hands-on training session on Apache DataFusion, the modern, extensible query execution engine built in Rust. This short training is designed to give data engineers, analysts, and developers a practical introduction to leveraging DataFusion for high-performance, in-memory query processing on structured data.
In this session, you'll learn:

  • What Apache DataFusion is and how it fits into the modern data stack
  • Core concepts: logical plans, physical plans, and execution contexts
  • How to query data using SQL and DataFrames
  • Practical use cases including CSV, Parquet, and in-memory tables
  • Extending DataFusion with custom functions and datasources

Who Should Attend:
This training is ideal for engineers and architects working with large-scale data processing pipelines, especially those exploring Rust-based solutions or looking to integrate DataFusion into their existing infrastructure.
Format:
The session will include a mix of presentations and live coding. Attendees will leave with working examples and the knowledge to begin experimenting with DataFusion in their environments.

Photo of York Database Internals Meetup Group group
York Database Internals Meetup Group
See more events
FREE
10 spots left