Notably, both papers present negative results, showing that many efficient Transformers will provably struggle with reasoning tasks unless the number of parameters is significantly increased.
Efficient Transformers Struggle with Reasoning Tasks Without More Parameters
By
–
Leave a Reply