← back
x.comGrigory SapunovSun, May 17, 2026, 12:55 PM PDT
score 12.7
56likes13RT1reply

Reward models learn nine times faster than world models

Original: 1/

Source: x.com

Writing ELI5 summary…