I learned about this trick from this paper "Reversible Vision Transformers" by @Karttikeya_m et al (
https://
arxiv.org/abs/2302.04869). Not an expert here, but I think the reversible transformation is derived from previous work on NICE (
https://
arxiv.org/abs/1410.8516). 6/6
Reversible Vision Transformers: Memory-Efficient Deep Learning Architecture
By
–
Leave a Reply