Congrats on the release! Are there plans to release the previous Flash 8B? I think it'll be a really helpful research artefact for the community!
@reach_vb
-
GPU Availability Challenges for AI Projects
By
–
I have an open thread on slack – if the GPU gods allow, then we shall!
-
OAI Spec Compliance: Drop-in Replacement Solution
By
–
Not that I’m aware of, everything is as advertised. Let me know if you encounter any edge cases. P.S. it’s OAI spec compliant so should be drop in replacement
-
Top Trending Text Generation Models on Hugging Face
By
–
This should be the most up-to-date: https://
huggingface.co/models?inferen
ce=warm&pipeline_tag=text-generation&sort=trending
… -
API Deployments and Evaluation Tools on Hub
By
–
Happy to make some API deployments on the hub if you want to run evals. Or hook you up with a Pro sub to run evals on Inference API. Let me know
-
DeepSeek-V2.5-1210 Model Weights Released on Hugging Face
By
–
Model weights: https://
huggingface.co/deepseek-ai/De
epSeek-V2.5-1210
… -
DeepSeek-V2.5-1210 Upgrade Achieves 82.8% on MATH-500
By
–
Let’s gooo! The whale is back w/ DeepSeek-V2.5-1210 an upgraded version of DeepSeek-V2.5, offering improvements in: > 74.8% to 82.8% on MATH-500
> 29.2% to 34.38% on LiveCodebench
> writing and Reasoning: notable improvements in internal tests
> optimised file upload and -
Request for Summary with Token Limit Adjustment
By
–
Thank you for the summary OmarGPT, next time, please use max_tokens = 160
-
Tencent Prepares V2.0 Release of Major Product
By
–
Congrats on the release! All eyes on Tencent to ship v2.0 of this: