AI Dynamics

Global AI News Aggregator

About

Jamba 1.5 Models: Hybrid SSM-Transformer Architecture Innovation

The Jamba 1.5 models are based on our novel hybrid SSM-Transformer architecture, which combines the quality, speed and efficiency of both. Jamba 1.5 Mini has 12B active/52B total parameters, while Large is 94B active/398B total – the largest Mamba model ever made. [2/6]

→ View original post on X — @ai21labs