What if LLMs could read a million tokens without exploding in cost? Subquadratic, an AI research startup, just unveiled SubQ — the first model built on fully subquadratic sparse attention (SSA). Instead of comparing every token pair, SSA routes attention only to the truly
Subquadratic Unveils New SubQ Model Using Subquadratic Sparse Attention
By
–
