Yes, we’re fast. In fact, the fastest! 🚀🚀
— SambaNova (@SambaNovaAI) 1 octobre 2024
SambaNova Cloud delivers the fastest inference on @AIatMeta's Llama 3.2 1B and 3B — all running at full-precision.
✅ 2470 tokens per sec on 1B
✅ 1566 tokens per sec on 3B#LLM #AI Start developing ⤵️
Yes, we’re fast. In fact, the fastest! SambaNova Cloud delivers the fastest inference on @AIatMeta
's Llama 3.2 1B and 3B — all running at full-precision. 2470 tokens per sec on 1B 1566 tokens per sec on 3B #LLM #AI Start developing
