RAG with vision models

Name: RAG with vision models
Start: 2024-09-09T16:00:00-04:00
End: 2024-09-09T17:00:00-04:00

Hosted by Microsoft R.

Microsoft Reactor New York

Details

RAG (Retrieval Augmented Generation) is a way to get LLMs to answer questions grounded in a particular knowledge base. What do you do when your knowledge base includes images, like graphs or photos? You first need to generate embeddings using a multimodal model, like the one available from Azure Computer Vision, search those embeddings using a powerful vector search like Azure AI Search, and then send any retrieved text and images to a multimodal LLM like GPT-4o. Learn how to get started quickly with a RAG on multimodal documents in this session.

Presented by Pamela Fox, Python Advocate at Microsoft

** Part of RAGHack, a free global hackathon to developer RAG applications. Join at https://aka.ms/raghack **

📌 Check out the RAGHack 2024 series here!

Pre-requisites:

Read the official rules and join the hack at https://aka.ms/raghack. No Purchase Necessary. Must be 18+ to enter. Contest ends 9/16/24.
Want more hands-on RAG training? Visit the Reactor series home page to see all the RAGHack 2024 sessions!

Microsoft Reactor New York

Microsoft Reactor YouTube

Microsoft Learn AI Hub

Microsoft Copilot Hub

Microsoft Reactor LinkedIn

RAG with vision models

Microsoft Reactor New York

Details

Sponsors

Microsoft Reactor YouTube

Microsoft Learn AI Hub

Microsoft Copilot Hub

Microsoft Reactor LinkedIn

You may also like