← back
arXivYuanqing Cai, Ziyi Huang, Minhao Liu, Lixin Duan, Wen Li, Yanru ZhangTue, May 19, 2026, 10:15 AM PDT
score 16.5

LLMs fail to notice hidden clues even in explicit instructions

Original: MixRea: Benchmarking Explicit-Implicit Reasoning in Large Language Models

Source: arxiv.org

Writing ELI5 summary…