← back
arXivNianyi Lin, Jiajie Zhang, Lei Hou, Juanzi LiFri, May 29, 2026, 10:51 AM PDT
score 14.7

Training AI to reason better through long documents

Original: LongTraceRL: Learning Long-Context Reasoning from Search Agent Trajectories with Rubric Rewards

Source: arxiv.org

Writing ELI5 summary…