arXivAreeb Gani, Asal Meskin, Gabrielle Kaili-May Liu, Arman CohanTue, Jun 2, 2026, 10:53 AM PDT

score 16.5

Measuring how well AI reasoning models express their confidence

Original: Quantifying Faithful Confidence Expression in Large Reasoning Models

Writing ELI5 summary…