
What we’re about
This meetup is focused on AI Performance Engineering.
Upcoming events (4)
See all- NVIDIA Dynamo + Disaggregated Prefill-Decode LLM Serving + CUDA OptimizationsLink visible for attendees
Zoom link: https://us02web.zoom.us/j/82308186562
Talk #0: Introductions and Meetup Updates
by Chris Fregly and Antje BarthTalk #1: NVIDIA Dynamo + Disaggregated Prefill-Decode LLM Serving by Chris Alexiuk @ NVIDIA
NVIDIA Dynamo splits LLM serving into disaggregated prefill and decode stages, letting each scale independently for better throughput under latency constraints. We'll dive deep into how Dynamo does disaggregated serving in this session.Talk #2: High Performance CUDA Optimizations by Chris Fregly and Others
CUDA Optimizations for high-performance AI.Zoom link: https://us02web.zoom.us/j/82308186562
Related Links
Github Repo: http://github.com/cfregly/ai-performance-engineering/
O'Reilly Book: https://www.amazon.com/Systems-Performance-Engineering-Optimizing-Algorithms/dp/B0F47689K8/
YouTube: https://www.youtube.com/@AIPerformanceEngineering
Generative AI Free Course on DeepLearning.ai: https://bit.ly/gllm - The AI Conference 2025 (In-Person @ Pier 48 Mission Bay, Sept 17-18 15% Off)Needs location
RSVP with 15% discount code using Fregly25 at https://aiconference.com/#tickets
Join us for the most anticipated AI event of the year and share two days with the brightest minds in AI.
The AI Conference 2025 is an in-person event scheduled for Wednesday, September 17th and Thursday, September 18th at Pier 48 in Mission Bay, San Francisco. Your admission covers both knowledge-filled days with amazing opportunities to learn, connect and build with this vibrant community.- 2 Days | 100+ Speakers | 4 Tracks
- 85+ Top AI Companies Exhibiting
- The newest agentic, robotic, and frontier AI technology
- Deep industry-specific talks on Applied AI
- Live AI-tech Startup Competition with VC Judges
- Meaningful Networking & 6 Months App Access
- AI After Dark Networking Mixer + Expo Booth Crawl
- The AI Conference Hack Day
RSVP with 15% discount code using Fregly25 at https://aiconference.com/#tickets
- GPU, CUDA, and PyTorch Performance OptimizationsLink visible for attendees
Zoom link: https://us02web.zoom.us/j/82308186562
Talk #0: Introductions and Meetup Updates
by Chris Fregly and Antje BarthTalk #1: GPU, PyTorch, and CUDA Performance Optimizations
Talk #2: GPU, PyTorch, and CUDA Performance Optimizations
Zoom link: https://us02web.zoom.us/j/82308186562
Related Links
Github Repo: http://github.com/cfregly/ai-performance-engineering/
O'Reilly Book: https://www.amazon.com/Systems-Performance-Engineering-Optimizing-Algorithms/dp/B0F47689K8/
YouTube: https://www.youtube.com/@AIPerformanceEngineering
Generative AI Free Course on DeepLearning.ai: https://bit.ly/gllm - GPU, CUDA, and PyTorch Performance OptimizationsLink visible for attendees
Zoom link: https://us02web.zoom.us/j/82308186562
Talk #0: Introductions and Meetup Updates
by Chris Fregly and Antje BarthTalk #1: GPU, PyTorch, and CUDA Performance Optimizations
Talk #2: GPU, PyTorch, and CUDA Performance Optimizations
Zoom link: https://us02web.zoom.us/j/82308186562
Related Links
Github Repo: http://github.com/cfregly/ai-performance-engineering/
O'Reilly Book: https://www.amazon.com/Systems-Performance-Engineering-Optimizing-Algorithms/dp/B0F47689K8/
YouTube: https://www.youtube.com/@AIPerformanceEngineering
Generative AI Free Course on DeepLearning.ai: https://bit.ly/gllm