There is an alternate reality where Cray took their vector supercomputers, ditched FP64 calculations, and went with one FP32 pipe and a BF16 tensor core pipe. The same instruction set, memory architecture, and vector registers would have made a sweet deep learning machine, in
Cray Vector Supercomputers Alternative Design for Deep Learning
By
–
Leave a Reply