Evaluating AI Agents
In this course, you’ll build an AI agent, add observability to visualize and debug its steps, and evaluate its performance component-wise. In detail, you’ll: Distinguish between evaluating LLM-based systems and traditional software testing. Explore the basic structure of AI agents – routers, skills, and memory – and implement an AI agent from scratch. Add observability…







