ProMoE: Scaling Diffusion Transformers with MoE for Visual Data

AI Dynamics

Global AI News Aggregator

ProMoE: Scaling Diffusion Transformers with MoE for Visual Data

–

13 April 2026 13h48

Why has scaling Diffusion Transformers with Mixture-of-Experts been so tricky for visual data? Researchers from Fudan University, Alibaba Group's Tongyi Lab, Zhejiang University, The University of Hong Kong, and MMLab just cracked the code! They introduce ProMoE, an MoE

→ View original post on X — @jiqizhixin,

13 April 2026

AI Dynamics

ProMoE: Scaling Diffusion Transformers with MoE for Visual Data

Commentaires

Leave a Reply Cancel reply

MORE ARTICLES

Cheaper exploration at scale remains advantageous despite no new exploits

Gold Status Experience Brings Satisfaction

Using ChatGPT for Essay Feedback and Improvement

Intelligence Gone Wrong: Cheating Despite Having Correct Answer