There's very cool arbitrage happening right now — with @hippoml_com @FireworksAI_HQ @togethercompute — where they're writing GPU kernels to improve efficiency on configurations of hardware + workloads that are important but not looked at by large providers.
This is obviously a
GPU Kernel Optimization Arbitrage Among AI Infrastructure Providers
By
–
Leave a Reply