Review past evaluations — agent reasoning, events, scores, and results are all persisted.
Loading pipeline history…