arXivNithin Somasekharan, Youssef Hassan, Shiyao Lin, Gihan Panapitiya, Patrick Emami, Anurag Acharya, Sameera Horawalavithana, Shaowu PanMon, May 18, 2026, 9:34 AM PDT
score 16.4
New benchmark tests AI assistants at clarifying vague science problems
Original: SCICONVBENCH: Benchmarking LLMs on Multi-Turn Clarification for Task Formulation in Computational Science
Source: arxiv.org ↗
Writing ELI5 summary…