Skip to content

Vision-Language Club - Advancing Multimodal Agents and Physical AI at NVIDIA

Photo of Dana Arad
Hosted By
Dana A. and 2 others
Vision-Language Club - Advancing Multimodal Agents and Physical AI at NVIDIA

Details

For the next meeting of the Vision-Language Club we are very excited to host Lior Cohen for a talk on Vision-Language Models at NVIDIA: Advancing Multimodal Agents and Physical AI.

We will meet online, on the 23rd of July at 20:00. The talk will also be recorded and uploaded to NLP-IL's YouTube channel. The talk will be conducted in Hebrew.

This session explores the real-world impact of vision-language models (VLMs) as they move beyond the research lab, showcasing their contributions to multimodal agentic workflows and the rapidly advancing field of Physical AI and robotics. We will explore NVIDIA’s latest solutions across these domains, through the lens of its full-stack approach for intelligent systems across diverse environments. The discussion will incorporate insights from relevant research papers and practical industrial perspectives.
Special attention will be given to Vision-Language-Action (VLA) models - a new class of foundation models designed to integrate perception, natural language understanding, and physical actions. We will review NVIDIA’s most recent research in this space, including the Gr00t N1 model.

Lior Cohen is a senior GenAI solution architect at NVIDIA, specializing in large multimodal models and AI agents. Lior completed her master’s degree studies at Ben Gurion University. She is also a member of the leading team of NLP IL community.

### Call for Speakers:
Are you working on something exciting in the field of NLP or Vision-Language and eager to share it with the community? We’re looking for future speakers! Apply here to give a lecture at one of our upcoming meetups: bit.ly/nlp_il_talk

Photo of The Israeli Natural Language Processing Meetup (NLP IL) group
The Israeli Natural Language Processing Meetup (NLP IL)
See more events
FREE