AI Dynamics

Global AI News Aggregator

About

Doubling Model Parameters Without Increased Compute Cost

~2 x the parameters for the same compute cost. Basically free model sparsity (sparse w.r.t to enc/dec blocks).

→ View original post on X — @yitayml