Skip to content

London vLLM & inference technical meetup cover photo

London vLLM & inference technical meetup

9 members · Public group

Share

What we’re about

Join us for the first inference & vLLM technical meetup in London, bringing together AI practitioners, infrastructure and inference experts, as well as companies using vLLM in production.

Whether you're experimenting with vLLM or running large-scale inference workloads, this event is for you. Expect hands-on insights, real-world feedback, and open discussions with others working on optimizing inference at scale.

📍 Location: London, UK
🕖 Time: 6:30PM – 10:00PM
💬 Format: In-person

Agenda:
6:30 – 7:00 PM: Welcome
7:00 – 8:30 PM: Talks

Exxa - Etienne Balit (co-founder & CTO): intro to vLLM & deep-dive on speculative decoding
Hiverge - Alhussein Fawzi (co-founder & CEO): topic to be announced soon
Doubleword - Jamie Dborin (co-founder): batched inference
8:30 – 10 PM: Open networking & drinks + pizzas

We’ll discuss performance optimizations, scaling strategies, hardware compatibility, and more.

🎯 Who should come?ML engineers, infra & DevOps teams, AI founders, and anyone working on inference, using or evaluating vLLM in their stack.

Upcoming events

1

Mon, Oct 20 · 6:30 PM BST
London First inference & vLLM meetup (Gen-AI)
The Loading Bay - at Techspace, 25 Luke St, EC2A 4DS, London, GB
Join us for the first inference & vLLM technical meetup in London, bringing together AI practitioners, infrastructure and inference experts, as well as companies using vLLM in production.

Whether you're experimenting with vLLM or running large-scale inference workloads, this event is for you. Expect hands-on insights, real-world feedback, and open discussions with others working on optimizing inference at scale.

📍 Location: London, UK
🕖 Time: 6:30PM – 10:00PM
💬 Format: In-person

Agenda:

6:30 – 7:00 PM: Welcome

7:00 – 8:30 PM: Talks

Exxa - Etienne Balit (co-founder & CTO): intro to vLLM & deep-dive on speculative decoding

Hiverge - Alhussein Fawzi (co-founder & CEO): topic to be announced soon

Doubleword - Jamie Dborin (co-founder): batched inference

8:30 – 10 PM: Open networking & drinks + pizzas

We’ll discuss performance optimizations, scaling strategies, hardware compatibility, and more.
🎙️Do you want to become a speaker?
We're looking for speakers to share their technical experience with inference & vLLM. If you're interested, please fill this form 👉 Link

🎯 Who should come?
ML engineers, infra & DevOps teams, AI founders, and anyone working on inference, using or evaluating vLLM in their stack.
🎟️ Free registration – spots are limited
7 attendees

Organizers

Members

9

Related topics

IT Infrastructure

Machine Intelligence

Artificial Intelligence