AI Dynamics

Global AI News Aggregator

About

Linear and LayerNorm Biases Appear Useless in Experiments

I'll give it a shot! Btw it is biases in both Linear and LayerNorm that appear to be useless (from my admittedly smaller scale experiments).

→ View original post on X — @karpathy