← back
arXivZukang Xu, Xing Hu, Dawei YangTue, May 19, 2026, 2:05 AM PDT
score 17.0

New technique boosts accuracy of ultra-compressed AI model inference

Original: TORQ: Two-Level Orthogonal Rotation for MXFP4 Quantization

Source: arxiv.org

Writing ELI5 summary…