← back
arXivRashid MushkaniSat, May 30, 2026, 12:56 PM PDT
score 15.6

Urban AI models need reliability checks, not just accuracy scores

Original: Benchmarks for Vision-Language Models in Urban Perception Should Be Reliability-Aware and Negotiated

Source: arxiv.org

Writing ELI5 summary…