AI Dynamics

Global AI News Aggregator

About

MiniMax-M2 Agent-Native RL Training Paper Released

MiniMax-M2 paper just dropped The key focus of M2 is on something more agent-native. It trains on runnable workspaces and artifact-grounded rewards, then uses Forge to scale RL over long coding, app, search, and office-task trajectories. What's interesting is that M2.7

→ View original post on X — @askalphaxiv,