Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution paper page: https://
huggingface.co/papers/2307.06
304
… The ubiquitous and demonstrably suboptimal choice of resizing images to a fixed resolution before processing them with computer vision models has not yet been
NaViT: Vision Transformer for Flexible Aspect Ratios and Resolutions
By
–
