Skip to content

Details

Welcome to AI Build & Learn a weekly AI engineering stream where we pick a new topic and learn by building together.

​​​​This event is about evaluating LLM and RAG applications with Ragas, an open-source Python toolkit for objective, automated evals instead of vibes-based testing.

​We'll explore Ragas metrics for retrieval and generation quality, automated test data generation, and how to wire evals into a feedback loop so production data drives continuous improvement.

​Some things to look up to get started:

​​​Resources

​​In this stream

  • Intro to topic
  • ​​​​Community Discussion
  • Practical examples

​​​Community challenge (optional)
​​​Try spending 30–90 minutes during the week learning or building something related to the topic, then share what you’re working on in Slack.

​​​Note on Flyte / Union
​​​You may see Flyte used in some demos. Flyte is an open-source AI orchestration platform maintained by Union (where I work) for building scalable, durable, and observable AI workflows. You do not need to use Flyte to participate.

​​​Drop a comment with ideas for future topics (agents, RAG, MLOps, robotics, frameworks, and more).

Related topics

Artificial Intelligence
Artificial Intelligence Programming
Machine Learning
Python
Software Development

You may also like