Skip to content

Details

RAG (Retrieval Augmented Generation) is a way to get LLMs to answer questions grounded in a particular knowledge base. What do you do when your knowledge base includes images, like graphs or photos? You first need to generate embeddings using a multimodal model, like the one available from Azure Computer Vision, search those embeddings using a powerful vector search like Azure AI Search, and then send any retrieved text and images to a multimodal LLM like GPT-4o. Learn how to get started quickly with a RAG on multimodal documents in this session.

Presented by Pamela Fox, Python Advocate at Microsoft

** Part of RAGHack, a free global hackathon to developer RAG applications. Join at https://aka.ms/raghack **

📌 Check out the RAGHack 2024 series here!

Pre-requisites:

  • Read the official rules and join the hack at https://aka.ms/raghack. No Purchase Necessary. Must be 18+ to enter. Contest ends 9/16/24.

  • Want more hands-on RAG training? Visit the Reactor series home page to see all the RAGHack 2024 sessions!

Sponsors

Microsoft Reactor YouTube

Microsoft Reactor YouTube

Watch past Microsoft Reactor events on-demand anytime

Microsoft Learn AI Hub

Microsoft Learn AI Hub

Learning hub for all things AI

Microsoft Copilot Hub

Microsoft Copilot Hub

Learning hub for all things Copilot

Microsoft Reactor LinkedIn

Microsoft Reactor LinkedIn

Follow Microsoft Reactor on LinkedIn

You may also like