OpenShift (vLLM) Meetup
Details
The OpenShift Meetup team is excited to invite you to the inaugural in-person vLLM meetup in Melbourne hosted by Red Hat at the Melbourne Red Hat office.
This is your chance to connect with a growing community of vLLM users, developers, maintainers, and engineers from Red Hat. We'll dive deep into technical talks, share insights, and discuss our journey in optimizing LLM inference for performance and efficiency.
This is an in-person event only.
What to expect:
Technical insights
Networking with industry experts
Hands-on learning & demos
Agenda
17:30-18:00: Registration and Opening Remarks
18:00-18:30: Turning GenAI Investments into Results: Why Inference Matters
18:30-19:00: AI Inference optimisation
19:00-19:30: Pizza and drinks
19:30-20:00: Intro to vLLM and it's techniques: Quantization, KV Cache, Paged-Attention, and Continuous Batching
20:00-20:30: vLLM Inference Demo – NVIDIA GPU Accelerator
20:30-21:00: Networking and drinks
