AI Dynamics

Global AI News Aggregator

About

Tencent releases Hunyuan-Large: open MoE model beats LLaMA 3.1-405B

> Hunyuan-Large just released by @TencentGlobal : Largest ever open MoE LLM, only 52B active parameters but beats LLaMA 3.1-405B on most academic benchmarks! Key insights: Mixture of Experts (MoE) architecture: 389 B parameters in total, but only 52B are activated for any

→ View original post on X — @aymericroucher