AI Dynamics

Global AI News Aggregator

About

Cerebras and Groq achieve impressive 2000 tokens per second speed

2000 token / second running Llama 3.1 70b. Thats insane! I have high hopes for Cerebras and Groq. Especially when reasoning models like o1 take much longer to "think".

→ View original post on X — @kimmonismus