arXivMaoyang Xiang, Bo Wang, Tao LuoMon, May 25, 2026, 10:52 AM PDT
score 16.5
Quantization method speeds up AI models on edge devices
Original: OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization
Source: arxiv.org ↗
Writing ELI5 summary…