Huawei is gearing up to release Pangu Ultra—a 135B dense Transformer trained on 13.2T tokens with 8,192 Ascend NPUs. Despite fewer parameters, it matches DeepSeek-R1 in performance.
No full tech report or model yet. Check it out: https://
arxiv.org/abs/2504.07866
Huawei Pangu Ultra 135B Matches DeepSeek-R1 Performance
By
–
Leave a Reply