x.comwill brownMon, May 18, 2026, 1:07 PM PDT
score 17.0
508likes30RT9reply
Novel training approach helps AI agents learn during reinforcement learning
Original: god what a beautiful objective. i wonder how general you can push this. best non-distillation answer ive seen for knowledge acq during RL, feels bitter-pilled in a way that most self-teaching methods
Source: x.com ↗
Writing ELI5 summary…