AI Dynamics

Global AI News Aggregator

Cerebras and Groq achieve impressive 2000 tokens per second speed

2000 token / second running Llama 3.1 70b. Thats insane! I have high hopes for Cerebras and Groq. Especially when reasoning models like o1 take much longer to "think".

→ View original post on X — @kimmonismus,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *