← back
x.comRishabh AgarwalFri, May 22, 2026, 2:54 PM PDT
score 16.2
309likes18RT6reply

RL learns from what actually happens, not just examples

Original: Very well written blog. I think of RL as learning from interventions, and it kinda explains why it's more powerful as a paradigm than supervised learning.

Source: x.com

Writing ELI5 summary…