arXivJeremias Bohn, Tizian Dippold, Mahdi Koubaa, Elias R. Wahl, Georg GrohWed, Jul 1, 2026, 9:13 AM PDT
score 17.1
Logarithmic quantization makes language models run faster on consumer GPUs
Original: $\text{Log}_\text{b}$Quant: Quantizing Language Models in Logarithmic Space
Source: arxiv.org ↗
Writing ELI5 summary…