How do we make LLMs faster and lighter? Don’t force the GPU to adapt to sparsity. Reshape the sparsity to fit the GPU! ⚡️
— Sakana AI (@SakanaAILabs) 8 mai 2026
Excited to share our new #ICML2026 paper in collaboration with @NVIDIA: "Sparser, Faster, Lighter Transformer Language Models". This work introduces new… pic.twitter.com/ehByWHIh6I
How do we make LLMs faster and lighter? Don’t force the GPU to adapt to sparsity. Reshape the sparsity to fit the GPU! Excited to share our new #ICML2026 paper in collaboration with @NVIDIA
: "Sparser, Faster, Lighter Transformer Language Models". This work introduces new