x.comIgor FomenkoSat, May 30, 2026, 2:13 AM PDT
score 15.6
1reply
Making AI benchmarks harder without accidentally adding bias
Original: @cwolferesearch Interesting point, I’ve found the hardest part is defining harder without bias.
Source: x.com ↗
Writing ELI5 summary…