arXivXiongwei Zhu, Xiaojian Liao, Tianyang Jiang, Yusen Zhang, Liang Wang, Limin XiaoTue, May 26, 2026, 7:32 AM PDT
score 16.4
Smarter routing speeds up sparse AI models on memory-limited devices
Original: ReMoE: Boosting Expert Reuse through Router Fine-Tuning in Memory-Constrained MoE LLM Inference
Source: arxiv.org ↗
Writing ELI5 summary…