arXivLi Jiang, Haoran Xu, Yichuan Ding, Amy ZhangSat, Jun 6, 2026, 8:17 PM PDT
score 15.9
Fixing language model training by correcting wrong reasoning paths
Original: Trajectory-Refined Distillation
Source: arxiv.org ↗
Writing ELI5 summary…
Original: Trajectory-Refined Distillation
Source: arxiv.org ↗
Writing ELI5 summary…