AI Dynamics

Global AI News Aggregator

About

SmolLM: Training Long Context Models at 0.5-1.5B Parameters

SmolLM for the win. Cool write up on training long context models around 0.5-1.5B parameters by the Jina team

→ View original post on X — @thom_wolf