x.comSebastian RaschkaThu, May 21, 2026, 4:10 PM PDT
score 17.0
76likes15RT11reply
Gated DeltaNet-2 improves linear attention with separate memory controls
Original: Gated DeltaNet has been one of my favorite "hybrid attention" newcomers in the good old transformer stack.
Source: magazine.sebastianraschka.com ↗
Writing ELI5 summary…