← back
arXivYaocheng Zhang, Jiajun Chai, Songjun Tu, Yuqian Fu, Xiaohan Wang, Wei Lin, Guojun Yin, Qichao Zhang, Yuanheng Zhu, Dongbin ZhaoFri, May 29, 2026, 9:12 AM PDT
score 14.7

Shorter rollouts make AI training cheaper without losing quality

Original: Are Full Rollouts Necessary for On-Policy Distillation?

Source: arxiv.org

Writing ELI5 summary…