100 Tokens per second per user on #Llama2 from @MetaAI! This ultra-low latency performance could have a massive impact on workloads using #LLMs for everyone from artists to analysts, programmers to educators, all #GenAI and beyond. Book your demo to learn more: contact@groq.com pic.twitter.com/VWcSqPRm18
— Groq Inc (@GroqInc) 8 août 2023
100 Tokens per second per user on #Llama2 from @MetaAI
! This ultra-low latency performance could have a massive impact on workloads using #LLMs for everyone from artists to analysts, programmers to educators, all #GenAI and beyond. Book your demo to learn more: contact@groq.com