AI Dynamics

Global AI News Aggregator

SmolLM: Training Long Context Models at 0.5-1.5B Parameters

SmolLM for the win. Cool write up on training long context models around 0.5-1.5B parameters by the Jina team

→ View original post on X — @thom_wolf,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *