SmolLM for the win. Cool write up on training long context models around 0.5-1.5B parameters by the Jina team
SmolLM: Training Long Context Models at 0.5-1.5B Parameters
By
–
Global AI News Aggregator
By
–
SmolLM for the win. Cool write up on training long context models around 0.5-1.5B parameters by the Jina team
Leave a Reply