Evaluating multi-agent systems when ground truth is incomplete
Three eval modes — output, trajectory, and side effect — and the practical pattern we use when there's no single right answer.
#agents
#eval
#production-ml
Field notes on applied AI/ML. What actually ships vs. what demos.