arXivWalter Nelson, Theofanis Karaletsos, Francesco LocatelloFri, May 29, 2026, 5:44 AM PDT
score 15.4
Making sparse autoencoders more stable and interpretable
Original: Toward Identifiable Sparse Autoencoders
Source: arxiv.org ↗
Writing ELI5 summary…