AI Dynamics

Global AI News Aggregator

About

Training Models to Generate Gist Tokens for Intermediate Reasoning

yeah this is very related idea — dense vectors convey more information than discrete tokens, in less space! but it's not the same thing, i'm describing how to train a model that generates gist tokens on the fly for its intermediate reasoning steps

→ View original post on X — @jxmnop