A simple post while you are easing back in after the holiday break The fastest #LLM #Inference speed is available to try right now at http://
Groq.com running #Llama 2, 70B with 4k sequence length. No "tricks" for our speed, we're an LPU™ based system. Try it out.
Groq Offers Fastest LLM Inference Speed with Llama 2 70B
By
–
Leave a Reply