← back
arXivLong Phan, Devin Kim, Alexander Pan, Alice Blair, Adam Khoja, Dan HendrycksThu, May 21, 2026, 10:32 AM PDT
score 14.8

Training method reduces hidden political bias in AI language models

Original: Reducing Political Manipulation with Consistency Training

Source: arxiv.org

Writing ELI5 summary…