x.comZhu LiangSat, Jul 4, 2026, 10:43 AM PDT
score 16.5
110likes4RT35reply
Exploring agents as evaluators instead of LLMs for AI testing
Original: what if instead of llm as judge, we use agent as judge, for evals?
Source: x.com ↗
Writing ELI5 summary…