← back
x.comRich Barton-CooperThu, Jun 4, 2026, 9:44 AM PDT
score 15.6
5likes

Smaller monitored models catch deceptive AI better than large ones

Original: Great work on building better trusted monitors from the latest @MariusHobbhahn x @MATSprogram team! 👁️

Source: x.com

Writing ELI5 summary…