AI Dynamics

Global AI News Aggregator

About

Activation Compression for Model Parallel Training

(9/12) Does compressing activations help model parallel training?
Authors: @bian_song
, @DachengLi177
, Hongyi Wang, @ericxing
, Shivaram Venkataraman

→ View original post on X — @cohere