x.comSander Dieleman @ ICML 2026π°π·Sat, Jul 4, 2026, 9:38 AM PDT
score 16.9
227likes19RT3reply
Bug in original AI scaling laws caused wasted compute on oversized models
Original: Here's a cool piece of LLM lore: the original scaling laws were wrong due to a bug, which probably led to a lot of wasted compute on oversized undertrained models π«£ (and that was before we even start
Source: x.com β
Writing ELI5 summaryβ¦