← back
arXivHaoqi Wang, Lorenz K. Mueller, Jiawei Zhuang, Mathieu Salzmann, Lukas CavigelliFri, Jun 5, 2026, 3:11 AM PDT
score 15.3

New technique reduces accuracy loss when compressing large language models

Original: OffQ: Taming Structured Outliers in LLM Quantization by Offsetting

Source: arxiv.org

Writing ELI5 summary…