x.comalphaXivSun, May 17, 2026, 12:00 PM PDT
score 16.4
138likes28RT3reply
Pruning giant AI models beats training small ones from scratch
Original: “SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training”
Source: x.com ↗
Writing ELI5 summary…