← back
arXivSaket Reddy, Ke Yang, ChengXiang ZhaiWed, Jun 3, 2026, 5:31 AM PDT
score 17.1

Stabilizing bias removal in language models through group-relative training

Original: BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization

Source: arxiv.org

Writing ELI5 summary…