← back
arXivYi Zhao, Siqi Wang, Zhe Hu, Yushi Li, Jing LiFri, May 29, 2026, 7:28 AM PDT
score 14.6

Benchmark tests whether AI can judge systems helping blind users

Original: A Visually Impaired Assistance Benchmark for VLM-as-a-Judge Evaluation

Source: arxiv.org

Writing ELI5 summary…