← back
arXivJeremias Bohn, Tizian Dippold, Mahdi Koubaa, Elias R. Wahl, Georg GrohWed, Jul 1, 2026, 9:13 AM PDT
score 17.1

Logarithmic quantization makes language models run faster on consumer GPUs

Original: $\text{Log}_\text{b}$Quant: Quantizing Language Models in Logarithmic Space

Source: arxiv.org

Writing ELI5 summary…