← back
arXivLiyun Zhang, Jiayi GuoMon, May 25, 2026, 8:57 AM PDT
score 16.4

LLMs handle meaning changes worse than formatting changes

Original: When Do LLM Agents Treat Surface Noise Differently from Semantic Noise? A 68-Cell Measurement Study with a Held-Out Trace-Level Validation

Source: arxiv.org

Writing ELI5 summary…