We trained unstructured sparse 1.3B GPT-3 models on CS-2 systems and demonstrated how we achieve competitive results at a fraction of the inference FLOPs – our 83.8% sparse model achieved a 3x reduction in FLOPs at matching performance Learn more here:
Sparse GPT-3 Models Achieve 3x FLOP Reduction on CS-2
By
–