We know that Jamba 1.5 models are the fastest, but the question is – how fast? @ArtificialAnlys tested our models to find out The image below shows the throughput for various models (with prompt length = 10K tokens). Jamba 1.5 models are a whole lot faster – and that speed
Jamba 1.5 Models Demonstrate Significantly Superior Throughput Performance
By
–
