Name: AI Agent Evaluation: Practical guide to benchmark and improve AI Agents
Start: 2025-09-16T19:30:00+04:00
End: 2025-09-16T21:30:00+04:00

Happening Tomorrow!

AI Agent Evaluation: Practical Guide to Benchmark & Improve AI Agents
📅 September 16 \| 🕢 7:30 PM GST
📌[https://nas.io/artificialintelligence/events/ai-agent-evaluation](https://nas.io/artificialintelligence/events/ai-agent-evaluation)

AI agents look magical in demos—but often fail in the real world, eroding trust (remember Air Canada’s chatbot error or Google Bard’s costly slip?).
This talk introduces a practical playbook for evaluating agents, covering:
✅ Frameworks like RAGAS & TruLens
✅ Fresh ideas like Evaluation-Driven Development
✅ The path to an AI Quality Movement—where agents aren’t just impressive, but truly reliable.

Mohammad Arshad

Patricia Mari

MENA AI and Data Community

Technology

Innovation

Artificial Intelligence

Machine Learning

Courses and Workshops

Big Data

Business Intelligence

Entrepreneurship

Every week on Tuesday until October 28, 2025

AI Agent Evaluation: Practical guide to benchmark and improve AI Agents

Online event

Share this event

AI Agent Evaluation: Practical guide to benchmark and improve AI Agents

Details