← back
x.comRamp LabsTue, Jun 30, 2026, 3:13 PM PDT
score 16.6
55likes3RT4reply

New AI model runs more tests and costs more

Original: We ran Sonnet 5 in Ramp SWE-Bench, and observed that compared to its predecessor it:

Source: x.com

Writing ELI5 summary…