Decided to start a new blog series about model architectures in the era of LLMs. Here's part 1 on broader architectures like Transformer Encoders/Encoder-Decoders, PrefixLM and denoising objectives. A frequently asked question: "The people who worked on language and NLP
Model Architectures in the LLM Era: Transformers and Beyond
By
–
Leave a Reply