AI Dynamics

Global AI News Aggregator

Huawei Pangu Ultra 135B Matches DeepSeek-R1 Performance

Huawei is gearing up to release Pangu Ultra—a 135B dense Transformer trained on 13.2T tokens with 8,192 Ascend NPUs. Despite fewer parameters, it matches DeepSeek-R1 in performance.
No full tech report or model yet. Check it out: https://
arxiv.org/abs/2504.07866

→ View original post on X — @jiqizhixin,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *