London First inference & vLLM meetup (Gen-AI)
Details
Join us for the first inference & vLLM technical meetup in London, bringing together AI practitioners, infrastructure and inference experts, as well as companies using vLLM in production.
Whether you're experimenting with vLLM or running large-scale inference workloads, this event is for you. Expect hands-on insights, real-world feedback, and open discussions with others working on optimizing inference at scale.
๐ Location: London, UK
๐ Time: 6:30PM โ 10:00PM
๐ฌ Format: In-person
Agenda:
- 6:30 โ 7:00 PM: Welcome
- 7:00 โ 8:30 PM: Talks
- Exxa - Etienne Balit (co-founder & CTO): intro to vLLM & deep-dive on speculative decoding
- Hiverge - Alhussein Fawzi (co-founder & CEO): topic to be announced soon
- Doubleword - Jamie Dborin (co-founder): batched inference
- 8:30 โ 10 PM: Open networking & drinks + pizzas
Weโll discuss performance optimizations, scaling strategies, hardware compatibility, and more.
๐๏ธDo you want to become a speaker?
We're looking for speakers to share their technical experience with inference & vLLM. If you're interested, please fill this form ๐ Link
๐ฏ Who should come?
ML engineers, infra & DevOps teams, AI founders, and anyone working on inference, using or evaluating vLLM in their stack.
๐๏ธ Free registration โ spots are limited