AI Dynamics

Global AI News Aggregator

Per-layer embeddings likely unused in final model implementation

I saw the per-layer embeddings in the code, but I don't think they were used in the final models. Maybe it was a left-over from some internal experiments.

→ View original post on X — @rasbt,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *