arXivNianyi Lin, Jiajie Zhang, Lei Hou, Juanzi LiFri, May 29, 2026, 10:51 AM PDT
score 14.7
Training AI to reason better through long documents
Original: LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards
Source: arxiv.org ↗
Writing ELI5 summary…