AI Dynamics

Global AI News Aggregator

About

MiniMax-01 Introduces 456B Parameter Model with Mixture-of-Experts

2). MiniMax-01 – introduces a new series of models that integrate Mixture-of-Experts; introduces a model with 32 experts and 456B parameters, and 45.9B are activated for each token…

→ View original post on X — @dair_ai