AI Dynamics

Global AI News Aggregator

Qwen3-Next-80B-A3B: Efficient MoE Model with Superior Performance

Qwen3-Next-80B-A3B is out 80B params, but only 3B activated per token → 10x cheaper training, 10x faster inference than Qwen3-32B.(esp. @ 32K+ context!) Qwen3-Next-80B-A3B-Instruct approaches our 235B flagship. Qwen3-Next-80B-A3B-Thinking outperforms

→ View original post on X — @_akhaliq,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *