2). MiniMax-01 – introduces a new series of models that integrate Mixture-of-Experts; introduces a model with 32 experts and 456B parameters, and 45.9B are activated for each token…
MiniMax-01 Introduces 456B Parameter Model with Mixture-of-Experts
By
–
