arXivHaoran Zhao, Soyeon Caren Han, Eduard HovySat, Jun 6, 2026, 6:11 PM PDT
score 15.8
Multimodal AI models hide internal instability despite correct answers
Original: When Correct Decisions Hide Internal Stress: Decision-State Probing in Multimodal Language Models
Source: arxiv.org ↗
Writing ELI5 summary…