Common Q: Can you train language model w diffusion?
Favorite A: read this post (the whole blog is excellent) (Roughly speaking state of the art generative AI is either trained autoregressively or with diffusion. The underlying neural net usually a Transformer.)
Language Models Training: Autoregressive vs Diffusion Approaches
By
–
Leave a Reply