"Hyperloop Transformers" This paper propose a memory-efficient LLM via looped Transformers. They basically reuse the middle block across depth, then add hyper-connections only between loops. Key result is that this restores flexibility lost from weight sharing, letting the
Hyperloop Transformers: Memory-Efficient LLM via Looped Architecture
By
–
