Visual Guide/ConceptsCoding EvalsTask-level scoring for code-writing agents — human-seeded cases, regression memory, production traces.Appears in Chapter 04 — Evals Are the Control System →← All concepts