x.comCarlos E. PerezSat, May 23, 2026, 4:00 AM PDT
score 25.1
32likes8RT1reply
Testing AI Output, Not Building It, Is the Real Bottleneck
Original: Why Evaluation Is the Bottleneck: A Structural Account of Human Judgment in Production AI
Source: x.com ↗
Writing ELI5 summary…