(5/n) MediSwift was trained in three sizes (Med, Large, and XL), with dense and sparse variants, on a Cerebras Wafer-Scale Cluster with only a few configuration changes. Cerebras makes it easy to experiment and train production-ready models. Contact us to learn how we can help
MediSwift Trained on Cerebras Wafer-Scale Cluster
By
–
Leave a Reply