Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Architecture-Routed Mixture-of-Experts paper page: https://
huggingface.co/papers/2306.04
845
… Weight-sharing supernet has become a vital component for performance estimation in the state-of-the-art (SOTA) neural architecture
Mixture-of-Supernets: Improving Weight-Sharing Supernet Training with Experts
By
–
