AI Dynamics

Global AI News Aggregator

About

Reasoning in Latent States and Tokens for Transformers

What if you taught transformers to reason in both latent states and tokens? This Microsoft paper adds a self-supervised objective of predicting the next latent state to the standard next-token training, where a lightweight dynamic model

→ View original post on X — @askalphaxiv