(9/12) Does compressing activations help model parallel training?
Authors: @bian_song
, @DachengLi177
, Hongyi Wang, @ericxing
, Shivaram Venkataraman
Activation Compression for Model Parallel Training
By
–

By
–

(9/12) Does compressing activations help model parallel training?
Authors: @bian_song
, @DachengLi177
, Hongyi Wang, @ericxing
, Shivaram Venkataraman