Key takeaway: finetuning Flan-T5 is better and more compute-efficient than finetuning T5. In other words, Flan-T5 > T5 for every real scenario I can think of. Don't use the pre-trained checkpoint—always finetune!
Flan-T5 Outperforms T5 in Finetuning Efficiency
By
–
Leave a Reply