Name: AI Agent Evaluation: Practical guide to benchmark and improve AI Agents
Start: 2025-09-16T19:00:00+04:00
End: 2025-09-16T21:00:00+04:00

Happening Tomorrow!

AI Agent Evaluation: Practical Guide to Benchmark & Improve AI Agents
📅 September 16 \| 🕢 7:30 PM GST
📌[https://nas.io/artificialintelligence/events/ai-agent-evaluation](https://nas.io/artificialintelligence/events/ai-agent-evaluation)

AI agents look magical in demos—but often fail in the real world, eroding trust (remember Air Canada’s chatbot error or Google Bard’s costly slip?).
This talk introduces a practical playbook for evaluating agents, covering:
✅ Frameworks like RAGAS & TruLens
✅ Fresh ideas like Evaluation-Driven Development
✅ The path to an AI Quality Movement—where agents aren’t just impressive, but truly reliable.

Mohammad Arshad

Patricia Mari

DubAI and Data Professional

Technology

Professional Development

Innovation

Education & Technology

Courses and Workshops

Big Data

Predictive Analytics

Data Science

Machine Learning

Data Analytics

AI Agent Evaluation: Practical guide to benchmark and improve AI Agents

Share this event

AI Agent Evaluation: Practical guide to benchmark and improve AI Agents

Details