← back
arXivWanqi Yang, Yuexiao Ma, Alexander Conzelmann, Xiawu Zheng, Michael W. Mahoney, T. Konstantin Rusch, Shiwei LiuWed, Jun 3, 2026, 8:03 AM PDT
score 16.6

Compress AI models without calibration data using weight patterns

Original: AlphaQ: Calibration-Free Bit Allocation for Mixture-of-Experts Quantization

Source: arxiv.org

Writing ELI5 summary…