AI Dynamics

Global AI News Aggregator

Language Models Training: Autoregressive vs Diffusion Approaches

Common Q: Can you train language model w diffusion?
Favorite A: read this post (the whole blog is excellent) (Roughly speaking state of the art generative AI is either trained autoregressively or with diffusion. The underlying neural net usually a Transformer.)

→ View original post on X — @karpathy,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *