← back
x.comXiuyu LiThu, May 28, 2026, 3:02 AM PDT
score 16.3
18likes6RT1reply

Using model internals to pick better training data for AI

Original: SAERL uses Sparse Autoencoders (the mechanistic interpretability tool) to drive LLM post-training data engineering from MODEL INTERNALS rather than external heuristics.

Source: x.com

Writing ELI5 summary…