Hiera represents a significant advance over the vision transformer architecture. This work outperforms SOTA while being up to 3.6x faster across a range of image and video tasks — without use of domain specialized modules. Code https://
bit.ly/45OSyhq
Hiera: Advanced Vision Transformer Architecture with 3.6x Speed Improvement
By
–
Leave a Reply