10/ Learning at Test Time – proposes new sequence modeling layers with linear complexity and an expressive hidden state; defines a hidden state as an ML model itself capable of updating even on test sequence; by a linear model and a two-layer MLP based hidden state is found to
Learning at Test Time: Linear Complexity Sequence Modeling Innovation
By
–