arXivJiahao Ying, Boxian Ai, Wei Tang, Siyuan Liu, Yixin CaoFri, May 22, 2026, 7:09 AM PDT
score 14.6
New benchmark tests whether AI agent skills actually work
Original: OpenSkillEval: Automatically Auditing the Open Skill Ecosystem for LLM Agents
Source: arxiv.org ↗
Writing ELI5 summary…