arXivYi Zhao, Siqi Wang, Zhe Hu, Yushi Li, Jing LiFri, May 29, 2026, 7:28 AM PDT
score 14.6
Benchmark tests whether AI can judge systems helping blind users
Original: A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation
Source: arxiv.org ↗
Writing ELI5 summary…