AI Dynamics

Global AI News Aggregator

About

XAttention Framework Accelerates Attention Computation 13.5x

A team led by @songhan_mit just released XAttention, a plug-and-play framework that accelerates attention computation by 13.5×!

→ View original post on X — @jiqizhixin,