> Hunyuan-Large just released by @TencentGlobal : Largest ever open MoE LLM, only 52B active parameters but beats LLaMA 3.1-405B on most academic benchmarks! Key insights: Mixture of Experts (MoE) architecture: 389 B parameters in total, but only 52B are activated for any
Tencent releases Hunyuan-Large: open MoE model beats LLaMA 3.1-405B
By
–
