AI Dynamics

Global AI News Aggregator

About

Hardware limitations on sparse neural activation in LLMs

The human brain is incredibly efficient because it only activates the specific neurons needed for a thought. Modern LLMs naturally try to do this too (> 95% of neurons in feedforward layers stay silent for any given word), but our hardware punishes them for it. One of the most

→ View original post on X — @hardmaru