We published new research on how we serve post-trained Qwen3 235B models on NVIDIA GB200 NVL72 Blackwell racks. GB200 is a major step up over Hopper for high-throughput inference on large MoE models, not just a training platform.
Research on Serving Qwen3 235B Models on NVIDIA GB200 Racks
By
–
