arXivSaket Reddy, Ke Yang, ChengXiang ZhaiWed, Jun 3, 2026, 5:31 AM PDT
score 17.1
Stabilizing bias removal in language models through group-relative training
Original: BiasGRPO: Stabilizing Bias Mitigation in High-Variance Reward Landscapes via Group-Relative Policy Optimization
Source: arxiv.org ↗
Writing ELI5 summary…