It clearly ablates the main architectures like encoder decoder, decoder only etc. it even tried prefix lm. Also a master class in llm objective functions.
Architecture Ablation Study and LLM Objective Functions Analysis
By
–
By
–
It clearly ablates the main architectures like encoder decoder, decoder only etc. it even tried prefix lm. Also a master class in llm objective functions.