arXivChinh Hoang, Mohammad Rashedul HasanWed, May 27, 2026, 10:38 AM PDT
score 16.5
Vision-language models sound smart but often miss real causality
Original: The Abstraction Gap in Vision-Language Causal Reasoning
Source: arxiv.org ↗
Writing ELI5 summary…