← back
arXivMuyu Pan, Shu Zhao, Nan Zhang, Philip Shin, Varun Parekh, Vijaykrishnan Narayanan, Rui ZhangMon, May 25, 2026, 6:42 AM PDT
score 16.4

Training AI models to refuse uncertain questions instead of guessing

Original: TIAR: Trajectory-Informed Advantage Reweighting for LLM Abstention Learning

Source: arxiv.org

Writing ELI5 summary…