Compute is all you need.
For a given amount of compute, ViT and ConvNets perform the same. Quote from this DeepMind article: "Although the success of ViTs in computer vision is extremely impressive, in our view there is no strong evidence to suggest that pre-trained ViTs
ViT and ConvNets Achieve Equal Performance at Same Compute
By
–
Leave a Reply