Great collab with @SakanaAILabs on an #ICML26 paper about sparse transformer kernels + formats optimized for modern NVIDIA GPU execution. • TwELL sparse packing
• Fused CUDA kernels
• 20%+ inference/training speedups at scale Paper + code below
New ICML26 Paper on Sparse Transformer Kernels for NVIDIA GPUs
By
–
