← back
x.comHamel HusainFri, May 22, 2026, 10:24 AM PDT
score 16.4
149likes18RT12reply

AI evaluation tools are still too immature for real work

Original: The experiments conducted in this post illustrate how early we are as an industry on eval tooling.

Source: x.com

Writing ELI5 summary…