Chat Against Private Documents with KAITO RAG Engine on Azure Kubernetes
Details
KAITO RAG Engine on Azure Kubernetes Service is a Kubernetes-native way to run a full Retrieval-Augmented Generation (RAG) backend fully hosted inside your own Azure environment.
I will demo a real RAG system on Azure Kubernetes Service (AKS) using the KAITO RAG Engine - https://github.com/kaito-project/kaito
You’ll see how to:
- High level architecture and configuration with KAITO RAG Engine on AKS
- Understand the RAG architecture (ingestion, embedding, retrieval, inference)
- Ingest and index real documents into the RAG Engine
- Connect the system to a Streamlit-based chatbot UI
- Query your own data using an LLM—live and in real time
The session is practical, code-driven, and demo-heavy, with a focus on how these components fit together in a cloud-native, scalable architecture.
Whether you’re a cloud engineer, platform engineer, or AI developer, you’ll walk away with a clear mental model—and a working reference of a RAG system on Kubernetes.




