4). LLMs Still Can’t Plan – evaluates whether large reasoning models such as o1 can plan; finds that a domain-independent planner can solve all instances of Mystery Blocksworld but LLMs struggle, even on small instances…
Large Language Models Struggle with Domain-Independent Planning Tasks
By
–
