PyTorch ATX: The Future of Inferencing

Hosted By
Jason M.

Details
Join PyTorch ATX this September for a hands-on look at the next generation of AI inference pipelines. We’ll explore the full modern stack—from aggressive model-size reductions like INT4/INT8 quantization and pruning, dynamic batching, paged-attention memory tricks, and multi-node scheduling. We'll dive into vLLM—today’s most popular open-source engine for high-throughput LLM inference—alongside other cutting edge inference stacks.
Expect deeply technical talks, live demos, and open Q&A with the engineers building and running these systems.
When: September 17, 2025
Where: Voltron Room - Capital Factory (1st Floor) in Austin, TX
Snacks and beverages will be provided.

PyTorch ATX
See more events
Capital Factory, Voltron Room (1st floor)
701 Brazos Street · Austin, TX
PyTorch ATX: The Future of Inferencing
FREE
250 spots left