arXivCharles Pert, Dalal Alrajeh, Alessandra RussoMon, May 25, 2026, 10:02 AM PDT
score 16.5
New neural network design handles longer sequences without retraining
Original: Length Generalization with Log-Depth Recurrent Units
Source: arxiv.org ↗
Writing ELI5 summary…