DeepSeek COOKED! It is the only open-weight (commercially permissive) model in the top 10. More so, it is a model that has been post-trained on ~5K H800 GPU hours (0.01M $) *only* – now imagine what a better post-trained model can do!
DeepSeek Breaks Into Top 10 With Minimal Post-Training Cost
By
–
