arXivMengze Hong, Xia Zeng, Zeyang Lei, Sheng Wang, Chen Jason Zhang, Di Jiang, Taiming Fu, Jinfeng Huang, Mengqiao Liu, Qinghe Chang, Haosheng Zou, Qiongyi Zhou, Sijun He, Chen Xiaoshuai, Simon Deng, Haojing Huang, Zijian Li, Lucas Mu Li, Fubao Zhang, Mona Zhou, Wei Ma, Chenxuan Ma, Yuanmeng Zhang, Jian Song, Minlong Peng, Di Liang, Davey ChenMon, Jun 8, 2026, 7:44 AM PDT
score 17.1
New benchmark measures how well AI assistants satisfy real users
Original: UXBench: Benchmarking User Experience in AI Assistants
Source: arxiv.org ↗
Writing ELI5 summary…