Making Agent Evals Isn’t As Hard As You Think!

Discussing the theory behind creating and using agent evals Resources: Evals Field Guide - https://lucek.ai/blogs/agent-evaluations Evaluation Concepts - https://docs.langchain.com/langsmith/... Demystifying Evals - https://www.anthropic.com/engineering... Chapters: 00:00 - Introduction 00:33 - Context 02:37 - What get’s measured 05:08 - How its measured 08:20 - Unit Test Evals 11:14 - Agent Integration Evals 14:49 - Online Evals 18:32 - Benchmark Evals 23:51 - Agent Eval Loop #ai #programming #datascience