x.comNathan LambertWed, Jun 3, 2026, 7:39 PM PDT
score 15.9
270likes20RT3reply
On-policy distillation improves AI model training efficiency
Original: Great little video on modern on-policy distillation in post-training recipes.
Source: x.com ↗
Writing ELI5 summary…