AI Dynamics

Global AI News Aggregator

About

Scaling LLM Training: Double Digit Batch Size for 70B Models

double digit batch size, eventual goal is one rack for 70B.

→ View original post on X — @cerebras