Yeah, for some applications it may be sufficient. But FNet doesn't outperform a contemporary attention-based architecture though
FNet performance limitations compared to attention-based architectures
By
–
By
–
Yeah, for some applications it may be sufficient. But FNet doesn't outperform a contemporary attention-based architecture though