Open-Set Person Search with Gemini and SigLIP in Retail Environments


Details
Discover how to combine Google’s Gemini Multimodal API and SigLIP to build a powerful open-set person search system for real-world applications like retail and security. This talk walks through the foundations of Vision-Language Models (VLMs), explains how they work under the hood, and demonstrates a real fine-tuning pipeline that enables natural language queries like "a woman with a blue jacket near the entrance" to find relevant people in video footage, even when they were never seen during training. We’ll also explore dataset curation with FiftyOne and practical challenges in deploying these systems.
Come out to Atomic Robot's office Wednesday evening, have some pizza with like-minded tech aficionados, and find enjoy this presentation from Google Developer Expert, Adonai Vera!
Agenda
---
Speaker
Adonai Vera
Hosted By
Greg Williams, GDG Organizer
Patrick Hammond, GDG Organizer
---
Partner
Atomic Robot (https://atomicrobot.com)
Atomic Robot graciously provides us with event space and often sponsors food at our meetups. Thanks Atomic Robot!
---
Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cincinnati-presents-open-set-person-search-with-gemini-and-siglip-in-retail-environments/.


Open-Set Person Search with Gemini and SigLIP in Retail Environments