arXivSridhar MahadevanTue, May 26, 2026, 9:36 AM PDT
score 16.4
New math framework unifies Transformer attention variants
Original: Kan Extension Transformers: A Categorical Unification of Attention, Diffusion, and Predict-Detach Self-Conditioning
Source: arxiv.org ↗
Writing ELI5 summary…