arXivJun Wang, Xiaohao Xu, Xiaonan HuangFri, May 29, 2026, 5:04 AM PDT
score 15.3
Vision-language models struggle to detect robot collisions safely
Original: Probing Collision Grounding in Vision-Language Models for Safe Human-Robot Collaboration
Source: arxiv.org ↗
Writing ELI5 summary…