← back
arXivNithin Somasekharan, Youssef Hassan, Shiyao Lin, Gihan Panapitiya, Patrick Emami, Anurag Acharya, Sameera Horawalavithana, Shaowu PanMon, May 18, 2026, 9:34 AM PDT
score 16.4

New benchmark tests AI assistants at clarifying vague science problems

Original: SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science

Source: arxiv.org

Writing ELI5 summary…