AI Dynamics

Global AI News Aggregator

About

Jamba: Revolutionary SSM-Transformer Open Model with 3X Throughput

Introducing Jamba, our groundbreaking SSM-Transformer open model! As the first production-grade model based on Mamba architecture, Jamba achieves an unprecedented 3X throughput and fits 140K context on a single GPU. Meet Jamba http://
ai21.com/jamba Build on @huggingface

→ View original post on X — @ai21labs