arXivZukang Xu, Xing Hu, Dawei YangTue, May 19, 2026, 2:05 AM PDT
score 17.0
New technique boosts accuracy of ultra-compressed AI model inference
Original: TORQ: Two-Level Orthogonal Rotation for MXFP4 Quantization
Source: arxiv.org ↗
Writing ELI5 summary…