Accelerating Data Processing with Apache DataFusion


Details
Join us for a focused, hands-on training session on Apache DataFusion, the modern, extensible query execution engine built in Rust. This short training is designed to give data engineers, analysts, and developers a practical introduction to leveraging DataFusion for high-performance, in-memory query processing on structured data.
In this session, you'll learn:
- What Apache DataFusion is and how it fits into the modern data stack
- Core concepts: logical plans, physical plans, and execution contexts
- How to query data using SQL and DataFrames
- Practical use cases including CSV, Parquet, and in-memory tables
- Extending DataFusion with custom functions and datasources
Who Should Attend:
This training is ideal for engineers and architects working with large-scale data processing pipelines, especially those exploring Rust-based solutions or looking to integrate DataFusion into their existing infrastructure.
Format:
The session will include a mix of presentations and live coding. Attendees will leave with working examples and the knowledge to begin experimenting with DataFusion in their environments.

Accelerating Data Processing with Apache DataFusion