Tag: observability

All the articles with the tag "observability".

Breaking Down Agent Evals (Part 1): A Practitioner's Guide

Part 1 of a 3-part series. Why traces (not code) are the source of truth in agents, the three observability primitives, run types, the metrics that matter at each level, the pass^k reliability metric, a five-step methodology for building an eval suite, and a filter funnel approach to why no single eval method is enough.

Published: 10 Feb, 2026
· agents / evals / observability