arXivXin Yan, Aqiang Wang, Zhenglin Wan, Xingrui Yuand Ivor TsangWed, Jun 3, 2026, 7:34 AM PDT
score 16.6
Method compresses diffusion language models to run faster and cheaper
Original: STaR-Quant: State-Time Consistent Post-Training Quantization for Diffusion Large Language Models
Source: arxiv.org ↗
Writing ELI5 summary…