Pretty cool idea! Great to see Flan-T5 (despite being the smallest model here) hold it's ground pretty well . It even outperforms other LMs like Dolly or StableLM. Also another noteworthy point is that at "compute-match", Flan-T5 3B is equivalent to the cost of a 1.5B
Flan-T5 Performance Analysis: Efficiency Across Language Models
By
–
Leave a Reply