x.comRishabh AgarwalFri, May 22, 2026, 2:54 PM PDT
score 16.2
309likes18RT6reply
RL learns from what actually happens, not just examples
Original: Very well written blog. I think of RL as learning from interventions, and it kinda explains why it's more powerful as a paradigm than supervised learning.
Source: x.com ↗
Writing ELI5 summary…