← back
x.comSander Dieleman @ ICML 2026πŸ‡°πŸ‡·Sat, Jul 4, 2026, 9:38 AM PDT
score 16.9
227likes19RT3reply

Bug in original AI scaling laws caused wasted compute on oversized models

Original: Here's a cool piece of LLM lore: the original scaling laws were wrong due to a bug, which probably led to a lot of wasted compute on oversized undertrained models 🫣 (and that was before we even start

Source: x.com β†—

Writing ELI5 summary…