Name: Exploring Kaito to streamline AI inference model deployment in Azure Kubernetes
Start: 2024-07-16T17:00:00-04:00
End: 2024-07-16T18:00:00-04:00

**About this session:**
Roy Kim will be presenting Kaito, an operator streamlining AI/ML inference model deployment in Kubernetes. Discover how Kaito simplifies deployment of large open-source inference models like Falcon and LLAMA2. Learn its unique features: managing large model files with container images, preset GPU configurations, auto-provisioning GPU nodes, and hosting on Microsoft Container Registry (MCR). See how Kaito simplifies the workflow of onboarding large AI inference models in Kubernetes.

**Learn more and develop your skills in Azure Kubernetes Service with this Microsof Learn training module:**
https://aka.ms/IntroToAKSLearn3

Alixia Blanc

Microsoft Reactor Toronto

Microsoft Reactor

Watch past Microsoft Reactor events on-demand anytime

Microsoft Reactor YouTube

Microsoft Learn AI Hub

Microsoft Copilot Hub

Microsoft Reactor LinkedIn

Technology

Open Source

Web Design

Software Development

Web Technology

Web Development

Computer Programming

New Technology

Cloud Computing

Geeks & Nerds

Technology Startups

Artificial Intelligence

Ray K

Michael Lane

Shiva

Shiva Kp

Aaiman Aamir

Ramkumar Arunamoorthy

Cliff