← back
x.comswyxTue, Jun 2, 2026, 11:33 PM PDT
score 16.5
39likes2RT12reply

Simple penalty method improves AI reasoning efficiency

Original: probably the best reward function for reasoning efficiency i've seen

Source: x.com

Writing ELI5 summary…