AI Dynamics

Global AI News Aggregator

About

Preconditioned DeltaNet Adds Curvature-Aware Linear Recurrence Modeling

“Preconditioned DeltaNet: Curvature-aware Sequence Modeling for Linear Recurrences” This paper views linear recurrences through a least-squares/test-time regression lens, and adds the missing curvature information via preconditioning. Main idea: precondition the delta-rule

→ View original post on X — @askalphaxiv,