← back
arXivKaiwen Xue, Tao Wei, Guoxin Zhang, Zhonghong Ou, Kaoyan Lu, Yu Feng, Yifan Zhu, Haoran LuoFri, May 29, 2026, 5:49 AM PDT
score 15.4

Benchmark for AI agents navigating and locating themselves in real world

Original: ERGeoBench:A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models

Source: arxiv.org

Writing ELI5 summary…