New benchmark tests AI agents across diverse real-world tasks

Original: Great contribution to this field by adding richer domains and skills to agentic evals curated by experts @harvey

Writing ELI5 summary…