x.comwill brownWed, Jun 3, 2026, 10:54 PM PDT
score 16.4
210likes8RT3reply
New benchmark tests AI on real-world physical engineering tasks
Original: wow this is like a whole framework for creating verifiable engineering (atoms not bits) environments
Source: x.com ↗
Writing ELI5 summary…