AI Dynamics

Global AI News Aggregator

About

Five Years Between Transformer Attention and FlashAttention Innovation

"5 years between Self-Attention Is All You Need and FlashAttention"
quite incredible stat, gives a pause

→ View original post on X — @karpathy