arXivTuc Nguyen, Thai LeSat, Jun 6, 2026, 10:01 PM PDT
score 16.0
Nonlinear steering improves control over AI language model behavior
Original: Beyond Linear Activation Steering: Invertible Latent Transformations for Controlling LLM Behavior
Source: arxiv.org ↗
Writing ELI5 summary…