x.comHamel HusainFri, May 22, 2026, 10:24 AM PDT
score 16.4
149likes18RT12reply
AI evaluation tools are still too immature for real work
Original: The experiments conducted in this post illustrate how early we are as an industry on eval tooling.
Source: x.com ↗
Writing ELI5 summary…