← back
x.comShenyang Deng ✈️ ICML2026Sat, May 30, 2026, 7:53 PM PDT
score 15.1
42likes1RT

HTMuon improves training efficiency for large language models

Original: Great work with tianyu!This is one of the earliest paper apply Σᵖ to Muon.

Source: x.com

Writing ELI5 summary…