AI Dynamics

Global AI News Aggregator

RETRO: Revisiting DeepMind’s Knowledge Outsourcing Architecture

RETRO (DeepMind, 2021) is a beautiful idea, one badly in need of revisiting the central innovation of retro is to have a small model decide what token to predict next, but outsource all knowledge to a large offline datastore this has the added benefit of allowing you to insert

→ View original post on X — @jxmnop,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *