Normalization Placement in Transformer Architectures: Text vs Vision

AI Dynamics

Global AI News Aggregator

Normalization Placement in Transformer Architectures: Text vs Vision

–

18 June 2023 20h59

Thanks! The relative position of normalization is one of the few things that changed about the original transformer architecture. I think it's not exactly clear where it should be placed. Most transformers for texts use post-norm(one above) whereas vision transformers tends to

→ View original post on X — @jeande_d,

18 June 2023

AI Dynamics

Normalization Placement in Transformer Architectures: Text vs Vision

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

The Only Real Bet We Have for the Future

wacrawl 0.2.0: Encrypted Git Backup for WhatsApp

Elon Musk shifts focus to engineering work

MyOneApp Failure: The Bundling Trap in Product Design