← back
arXivWalter Nelson, Theofanis Karaletsos, Francesco LocatelloFri, May 29, 2026, 5:44 AM PDT
score 15.4

Making sparse autoencoders more stable and interpretable

Original: Toward Identifiable Sparse Autoencoders

Source: arxiv.org

Writing ELI5 summary…