arXivHyunjin Cho, Youngji Roh, Jaehyung KimSat, Jun 6, 2026, 8:46 AM PDT
score 15.5
Unsupervised discovery of hidden reasoning patterns in language models
Original: Shared Semantics, Divergent Mechanisms: Unsupervised Feature Discovery by Aligning Semantics and Mechanisms
Source: arxiv.org ↗
Writing ELI5 summary…