arXivXiaolin Liu, Yilun Zhu, Xiangyu Zhao, Xuehui Wang, Yan Li, Xin Li, Haoyu Cao, Xing Sun, Shaofeng Zhang, Xu Yang, Zhihang Zhong, Xue YangMon, Jun 1, 2026, 10:32 AM PDT
score 16.6
Video AI models struggle to catch brief visual moments
Original: Moment-Video: Diagnosing Temporal Fidelity of Video MLLMs on Momentary Visual Events
Source: arxiv.org ↗
Writing ELI5 summary…