AI Dynamics

Global AI News Aggregator

About

LLM Agency and Reward Optimization: External vs Internal Motivation

One more question @balazskegl
: if the LLM was trained to optimise a reward, then it could be argued that it has agency. This is what happens in LLM post training. I guess the counter argument would be that such rewards are external. Yet, the Schrodinger argument is that the

→ View original post on X — @nandodf