← back
arXivCharles Pert, Dalal Alrajeh, Alessandra RussoMon, May 25, 2026, 10:02 AM PDT
score 16.5

New neural network design handles longer sequences without retraining

Original: Length Generalization with Log-Depth Recurrent Units

Source: arxiv.org

Writing ELI5 summary…