Name: [RG] Interactive Debugging and Steering of Multi-Agent AI Systems
Start: 2025-06-22T17:15:00Z
End: 2025-06-22T18:15:00Z

Our host [Dr Abdoullahi Diasse](https://www.linkedin.com/in/abdoullahi-diasse-phd-3514617a/), Senior AI Engineer at Syrate Sarl, will present the paper [Vision as LoRA](https://arxiv.org/abs/2503.20680) whose abstract follows:

We introduce Vision as LoRA (VoRA), a novel paradigm for transforming an LLM into an MLLM. Unlike prevalent MLLM architectures that rely on external vision modules for vision encoding, VoRA internalizes visual capabilities by integrating vision-specific LoRA layers directly into the LLM. This design allows the added parameters to be seamlessly merged into the LLM during inference, eliminating structural complexity and minimizing computational overhead. Moreover, inheriting the LLM's ability of handling flexible context, VoRA can process inputs at arbitrary resolutions.
To further strengthen VoRA's visual capabilities, we introduce a block-wise distillation method that transfers visual priors from a pre-trained ViT into the LoRA layers, effectively accelerating training by injecting visual knowledge. Additionally, we apply bi-directional attention masks to better capture the context information of an image. We successfully demonstrate that with additional pre-training data, VoRA can perform comparably with conventional encode-based MLLMs.

Galsen AI

Ismaila SECK

GalsenAI

Technology

Apache Spark

Data Science

Data Mining

Machine Learning

Big Data

Data Analytics

Hadoop

Professional Development

Artificial Intelligence

Deep Learning

[RG] Interactive Debugging and Steering of Multi-Agent AI Systems

Online event

Share this event

[RG] Interactive Debugging and Steering of Multi-Agent AI Systems

Details