AI Dynamics

Global AI News Aggregator

About

Chinese AI Models Optimized for Cheaper GPU Hardware

Side effect of blocking Chinese firms from buying the best NVIDIA cards: top models are now explicitly being trained to work well on older/cheaper GPUs. The new SoTA model from @Kimi_Moonshot uses plain old BF16 ops (after dequant from INT4); no need for expensive FP4 support.

→ View original post on X — @jeremyphoward