PlanBench-XL Evaluation of Long-Horizon Planning of Tool-Using LLM Agents in Large-Scale Tool Ecosystems
PlanBench-XL: Evaluation of Long-Horizon Planning of LLM Agents
By
–

By
–

PlanBench-XL Evaluation of Long-Horizon Planning of Tool-Using LLM Agents in Large-Scale Tool Ecosystems