← back
arXivElia Cunegatti, Marcus Vukojevic, Erik Nielsen, Giovanni IaccaMon, Jun 1, 2026, 10:52 AM PDT
score 16.6

Smarter compression removes parts of language models surgically

Original: From Layers to Submodules: Rethinking Granularity in Replacement-Based LLM Compression

Source: arxiv.org

Writing ELI5 summary…