arXivHaoqi Wang, Lorenz K. Mueller, Jiawei Zhuang, Mathieu Salzmann, Lukas CavigelliFri, Jun 5, 2026, 3:11 AM PDT
score 15.3
New technique reduces accuracy loss when compressing large language models
Original: OffQ: Taming Structured Outliers in LLM Quantization by Offsetting
Source: arxiv.org ↗
Writing ELI5 summary…