DIY LLMs: Hosting your own LLM inference, from silicon to service

Name: DIY LLMs: Hosting your own LLM inference, from silicon to service
Start: 2025-01-31T12:30:00-08:00
End: 2025-01-31T14:00:00-08:00
Location: 101 Howard St, University of San Francisco - Downtown Campus, San Francisco, CA 94105

Hosted by Aline A.

Meet the group

USF Data Science & Artificial Intelligence Speaker Series

No reviews yet

Details

Join us for the USF Data Science Speaker Series featuring Dr Charles Frye, Modal Developer Advocate!

Charles Frye is a passionate educator who specializes in teaching people to build AI applications. After publishing research in psychopharmacology and neurobiology, he got his Ph.D. at the University of California, Berkeley, for dissertation work on neural network optimization. He has taught thousands about the full stack of AI application development—from foundational linear algebra to advanced GPU techniques and creating defensible AI-driven businesses.

Charles will explore the essential components for running your own large language model (LLM) inference service. This talk will delve into:
• Compute options: CPUs, GPUs, TPUs, and LPUs.
• Model options: Qwen, LLaMA, and others.
• Inference server options: TensorRT-LLM, vLLM, and SGLang.
• Observability tools: OTel stack, LangSmith, W&B Weave, and Braintrust.

Don’t miss this opportunity to gain practical knowledge on building and hosting your own LLM services from a leading AI educator and expert!

#USFDataScienceSpeakerSeries #DataScience #MSDS #LLMs #AI #MachineLearning #AIApplications

Events in San Francisco, CA

DIY LLMs: Hosting your own LLM inference, from silicon to service

USF Data Science & Artificial Intelligence Speaker Series

Details

Members are also interested in