← back
arXivVincent Limbach, Jonas Dornbusch, David Lüdke, Stephan Günnemann, Leo SchwinnTue, Jun 2, 2026, 6:39 AM PDT
score 17.1

New attack method breaks AI safety defenses more reliably

Original: Black-box, Adaptive, Efficient, Transferable, Harmful, Applicable... Attacks Are All You Need to Break LLMs

Source: arxiv.org

Writing ELI5 summary…