3/5 Where most other tiny models choke at context lengths above 8K, Jamba Reasoning 3B stays steady, with a consistent 30-40 tokens/second on an M3 MacBook Pro, regardless of context size. This is up to an order of magnitude faster than other on-device models.
Jamba Reasoning 3B: Exceptional Performance on Extended Contexts
By
–
Leave a Reply