← back
arXivTuc Nguyen, Thai LeSat, Jun 6, 2026, 10:01 PM PDT
score 16.0

Nonlinear steering improves control over AI language model behavior

Original: Beyond Linear Activation Steering: Invertible Latent Transformations for Controlling LLM Behavior

Source: arxiv.org

Writing ELI5 summary…