arXivZijian LiuMon, May 18, 2026, 10:30 AM PDT
score 12.4
1cites
AdaGrad optimizer provably works with messy, extreme gradient noise
Original: Can Adaptive Gradient Methods Converge under Heavy-Tailed Noise? A Case Study of AdaGrad
Source: arxiv.org ↗
Writing ELI5 summary…