arXivLiyun Zhang, Jiayi GuoMon, May 25, 2026, 8:57 AM PDT
score 16.4
LLMs handle meaning changes worse than formatting changes
Original: When Do LLM Agents Treat Surface Noise Differently from Semantic Noise? A 68-Cell Measurement Study with a Held-Out Trace-Level Validation
Source: arxiv.org ↗
Writing ELI5 summary…