RETRO (DeepMind, 2021) is a beautiful idea, one badly in need of revisiting the central innovation of retro is to have a small model decide what token to predict next, but outsource all knowledge to a large offline datastore this has the added benefit of allowing you to insert
RETRO: Revisiting DeepMind’s Knowledge Outsourcing Architecture
By
–
Leave a Reply