Skip to content

Details

Speaker: Alekhya Guduri

Topic: Crafting Evals : Signals v/s noise

How to rigorously evaluate LLM-driven systems using structured frameworks like LLM-as-a-judge, golden datasets, and multi-dimensional quality metrics, while distinguishing true product improvements from noise in AI and ranking experiments.

Agenda:
🕛 4:00–4:05 — Welcome
🕛 4:05–4:55 — Alekhya Guduri - Crafting Evals : Signals v/s noise
🕛 4:55–5:00 — Closing

🎙️ Speaker Opportunities at Our Meetups

Have expertise in QA, leadership, or tech skills you'd love to share? We're always looking for engaging speakers for our monthly meetups. This is a fantastic platform to showcase your knowledge, connect with the community, and inspire others. If you're interested submit your topic here —we’d love to hear from you!

Related topics

AI and Society
Agile Testing
Software Engineering
Software QA and Testing
Test Automation

You may also like