My notes on Kimi-K2-Instruct-0905, aka Kimi K-2.1 – an incremental improvement on Moonshot's previous trillion parameter open weights model, now with twice the context length (256k up from 128k)
Kimi K-2.1: Moonshot’s Upgraded Open Weights Model
By
–