← back
arXivZijian LiuMon, May 18, 2026, 10:30 AM PDT
score 12.4
1cites

AdaGrad optimizer provably works with messy, extreme gradient noise

Original: Can Adaptive Gradient Methods Converge under Heavy-Tailed Noise? A Case Study of AdaGrad

Source: arxiv.org

Writing ELI5 summary…