x.comRich Barton-CooperThu, Jun 4, 2026, 9:44 AM PDT
score 15.6
5likes
Smaller monitored models catch deceptive AI better than large ones
Original: Great work on building better trusted monitors from the latest @MariusHobbhahn x @MATSprogram team! 👁️
Source: x.com ↗
Writing ELI5 summary…