x.comswyxTue, Jun 2, 2026, 11:33 PM PDT
score 16.5
39likes2RT12reply
Simple penalty method improves AI reasoning efficiency
Original: probably the best reward function for reasoning efficiency i've seen
Source: x.com ↗
Writing ELI5 summary…
Original: probably the best reward function for reasoning efficiency i've seen
Source: x.com ↗
Writing ELI5 summary…