AI Dynamics

Global AI News Aggregator

About

Google Cloud MultiSlice Scales ML Training Across TPU Pods

The @googlecloud MultiSlice system (see blog post in tweet below) is an external version of some internal software+hardware we've been using for a while to scale some of our largest ML training runs across multiple TPUv4 and TPUv5e pods. Scaling made easy!

→ View original post on X — @jeffdean