Name: LLM Multi-agent System: A real-world use case study for clinical simulation
Start: 2025-05-16T14:00:00-05:00
End: 2025-05-16T15:00:00-05:00

[AgentClinic](https://arxiv.org/pdf/2405.07960): A multimodal agent benchmark to evaluate AI in simulated clinical environments.

Evaluating large language models (LLM) in clinical scenarios is crucial to assessing their potential clinical utility. Existing benchmarks rely heavily on static question-answering, which does not accurately depict the complex, sequential nature of clinical decision-making. Here, we introduce AgentClinic, a multimodal agent benchmark for evaluating LLMs in simulated clinical environments that include patient interactions, multimodal data collection under incomplete information, and the usage of various tools, resulting in an in-depth evaluation across nine medical specialties and seven languages.

Slides for past meetups posted: [Github](https://github.com/YanXuHappygela/LLM-reading-group/)
Recordings have been posted at: [YanAITalk](https://www.youtube.com/@yanaitalk/videos)

Feel free to reach out if you want to present a paper or a use case at upcoming meetups!

**Note:** You must have a Zoom account to login (free account is sufficient). Link and password will be shared three days before the meeting.

Yan Xu

Houston Machine Learning

Technology

Artificial Intelligence

Big Data

Machine Learning

Deep Learning

Data Visualization

Predictive Analytics

Data Science

LLM Multi-agent System: A real-world use case study for clinical simulation

Online event

Share this event

LLM Multi-agent System: A real-world use case study for clinical simulation

Details