arXivZehua Cheng, Wei Dai, Jiahao SunWed, Jun 3, 2026, 8:48 AM PDT
score 16.6
Training method helps AI reasoning survive out-of-distribution examples
Original: Invariant Gradient Alignment for Robust Reasoning Distillation
Source: arxiv.org ↗
Writing ELI5 summary…