← back
arXivBohan Liu, Wenqian Ye, Guangzhi Xiong, Zhenghao He, Sanchit Sinha, Aidong ZhangThu, Jul 2, 2026, 10:55 AM PDT
score 17.1

Fixing CLIP's vulnerability to text in images without retraining

Original: Towards Robustness against Typographic Attack with Training-free Concept Localization

Source: arxiv.org

Writing ELI5 summary…