AI Dynamics

Global AI News Aggregator

About

Next-token prediction compresses latent structure into understanding

A model trained for next-token prediction is forced to build compressed representations of latent structure in text. Ilya Sutskever correctly refers to this phenomenon as understanding. Here, a model trained for next-step sensor prediction, with a robot that has proprioception

→ View original post on X — @nandodf