AI Dynamics

Global AI News Aggregator

About

Building a Transformer with simplified layers in nanoGPT

Yep, that's the one! (as @Thom_Wolf linked earlier too). I'd expect it's possible to build a Transformer with that kind of layer alone, would look much more pleasing. Will see if I can prototype in nanoGPT.

→ View original post on X — @karpathy,