← back
x.comwill brownWed, Jun 3, 2026, 10:54 PM PDT
score 16.4
210likes8RT3reply

New benchmark tests AI on real-world physical engineering tasks

Original: wow this is like a whole framework for creating verifiable engineering (atoms not bits) environments

Source: x.com

Writing ELI5 summary…