AI Dynamics

Global AI News Aggregator

Groq Serves LLaMA 3 at Record 800 Tokens Per Second

My mind is blown. @GroqInc is serving LLaMA 3 at over 800 tokens per second! 800. Tokens. Per. Second. This unlocks so many incredible use-cases. It's one thing to see my demo — it's another thing entirely to experience it for yourself. Do yourself a favor and try it asap.

→ View original post on X — @mattshumer_,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *