← back
x.comSebastian RaschkaThu, May 21, 2026, 4:10 PM PDT
score 17.0
76likes15RT11reply

Gated DeltaNet-2 improves linear attention with separate memory controls

Original: Gated DeltaNet has been one of my favorite "hybrid attention" newcomers in the good old transformer stack.

Source: magazine.sebastianraschka.com

Writing ELI5 summary…