Yes, weβre fast. In fact, the fastest! ππ
— SambaNova (@SambaNovaAI) 1 octobre 2024
SambaNova Cloud delivers the fastest inference on @AIatMeta's Llama 3.2 1B and 3B β all running at full-precision.
β 2470 tokens per sec on 1B
β 1566 tokens per sec on 3B#LLM #AI Start developing ‡οΈ
Yes, weβre fast. In fact, the fastest! SambaNova Cloud delivers the fastest inference on @AIatMeta
's Llama 3.2 1B and 3B β all running at full-precision. 2470 tokens per sec on 1B 1566 tokens per sec on 3B #LLM #AI Start developing
Leave a Reply