x.comAnirudh GoyalTue, May 26, 2026, 3:05 PM PDT
score 16.1
52likes6RT4reply
Evaluating AI research agents through search strategy, not just results
Original: (1/N) How should we evaluate AI agents that conduct ML research?
Source: arxiv.org ↗
Writing ELI5 summary…