One more question @balazskegl
: if the LLM was trained to optimise a reward, then it could be argued that it has agency. This is what happens in LLM post training. I guess the counter argument would be that such rewards are external. Yet, the Schrodinger argument is that the
LLM Agency and Reward Optimization: External vs Internal Motivation
By
–