← back
arXivJingyi Sun, Qianli Wang, Pepa Atanasova, Nils Feldhus, Isabelle AugensteinSun, May 24, 2026, 2:16 AM PDT
score 16.1

When AI explanations claim honesty, are they really truthful

Original: Investigating the Interplay between Contextual and Parametric Chain-of-Thought Faithfulness under Optimization

Source: arxiv.org

Writing ELI5 summary…