← back
arXivMaoyang Xiang, Bo Wang, Tao LuoMon, May 25, 2026, 10:52 AM PDT
score 16.5

Quantization method speeds up AI models on edge devices

Original: OrpQuant: Geometric Orthogonal Residual Projection for Multiplier-Free Power-of-Two Transformer Quantization

Source: arxiv.org

Writing ELI5 summary…