x.comShenyang Deng ✈️ ICML2026Sat, May 30, 2026, 7:53 PM PDT
score 15.1
42likes1RT
HTMuon improves training efficiency for large language models
Original: Great work with tianyu!This is one of the earliest paper apply Σᵖ to Muon.
Source: x.com ↗
Writing ELI5 summary…