By this, I mean something like, instead of creating an environment for the model to use during training, prompt a LLM to output what it thinks the environment would output for any given tool call.
LLM Simulated Environments for Agent Training Without Real Tools
By
–