AI Dynamics

Global AI News Aggregator

Residual Stream Projections and Information Preservation in Neural Networks

Of course it has access, the projections from each block into the residual stream can be learned to be zero and so preserve any information that is needed.

→ View original post on X — @karpathy,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *