“Reinforcement learning is to understanding your world, whereas LLMs are about mimicking people, doing what people say you should do. They’re not about figuring out what to do.”
– Richard Sutton https://
youtube.com/watch?v=21EYKq
UsPfg&t=159s
…
Reinforcement Learning vs LLMs: Understanding vs Mimicking
By
–
Leave a Reply