What we’re about
With the wide adoption of Kubernetes and cloud native deployments, fleet management of multiple clusters in single cloud as well as multi-cloud environments is an ongoing challenge for Enterprises. This meetup aims to bring together cluster operators, platform architects, and financial stake holders in multi-cluster space to trade notes on problems, solutions, and best practices. Join us if you are an active multi-cluster practitioner or curious about the space!
Upcoming events (1)See all
- Production GenAI with RAG on Multi-cluster Cloud KubernetesDatabricks Inc., San Francisco, CA
Co-hosted by Kobie Crawford at MosaicML/Databricks.
GenAI models with RAG have demonstrated high-quality results for a variety of use cases. Companies putting such models into production are finding that self-hosting them has control, privacy, performance, and cost advantages, but that it requires effective infrastructure management. Kubernetes clusters support orchestration and management across cloud-based computing resources and can provide a flexible platform for hosting production GenAI models with RAG. Join us to learn how to use a resource-aware policy-based approach for multiple cloud K8s clusters to handle production hosting of a set of GenAI models with RAG.
5:30-6:00pm: Mingle, food, drinks
6:00-6:10pm: Welcome message from Madhuri and Kobie Crawford (MosaicML/Databricks)
6:10-6:40pm: Resource-Aware Scheduling for Production GenAI with RAG running on Multi-cluster Cloud Kubernetes - Anne Holler (Elotl), David Southwell (DataStax)
6:40-7:10pm: Talk 2 - Ajay Saini (MosaicML/Databricks)