← back
x.comNathan LambertWed, Jun 3, 2026, 7:39 PM PDT
score 15.9
270likes20RT3reply

On-policy distillation improves AI model training efficiency

Original: Great little video on modern on-policy distillation in post-training recipes.

Source: x.com

Writing ELI5 summary…