vllm - memory management

Name: vllm - memory management
Start: 2025-10-30T18:30:00-07:00
End: 2025-10-30T20:00:00-07:00

Hosted by Jerry K. and 3 others

Meet the group

West Coast Machine Learning Meetup (aka East Bay/Trivalley)

No reviews yet

Details

Tonight we will begin a two part review of vLLM. We will start by reviewing vllm - memory management. This is the paper [2309.06180] Efficient Memory Management for Large Language Model Serving with PagedAttention https://share.google/JoXvHfNA7TSWtMAri

Every week we meet (virtually) and discuss the most interesting topics in AI, ML and Deep Learning.
We usually rotate weeks so that every other week one of the members presents a deep learning/machine learning paper, frequently paired with a video explaining the concepts. In the off weeks, we present information about different projects. We cover computer vision, language (LLM's), health/hard science and generative models (Diffusion, GAN's etc.).
People of all levels of skill are welcome--from newbies to machine learning, to PhD's in DL/ML/AI.
We have been meeting for over 6 years weekly and have developed a neat pace and community--and all are welcome. Come join us and stay abreast of the biggest topics in Artificial Intelligence.

vllm - memory management

West Coast Machine Learning Meetup (aka East Bay/Trivalley)

Details

Members are also interested in