x.comAvi ChawlaThu, May 21, 2026, 1:38 AM PDT
score 20.9
1,265likes147RT52reply
Natural language replaces hand-coded reward functions in AI agent training
Original: Karpathy's prediction about RL is coming true now!
Source: x.com ↗
Writing ELI5 summary…