← back
arXivHyunjin Cho, Youngji Roh, Jaehyung KimSat, Jun 6, 2026, 8:46 AM PDT
score 15.5

Unsupervised discovery of hidden reasoning patterns in language models

Original: Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms

Source: arxiv.org

Writing ELI5 summary…