x.comXiuyu LiThu, May 28, 2026, 3:02 AM PDT
score 16.3
18likes6RT1reply
Using model internals to pick better training data for AI
Original: SAERL uses Sparse Autoencoders (the mechanistic interpretability tool) to drive LLM post-training data engineering from MODEL INTERNALS rather than external heuristics.
Source: x.com ↗
Writing ELI5 summary…