← back
arXivJun Wang, Xiaohao Xu, Xiaonan HuangFri, May 29, 2026, 5:04 AM PDT
score 15.3

Vision-language models struggle to detect robot collisions safely

Original: Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration

Source: arxiv.org

Writing ELI5 summary…