9). Adapting while Learning – proposes a two-part fine-tuning approach that first helps LLMs learn from tool-generated solutions and then trains them to determine when to solve problems directly versus when to use tools; testing on math, climate science, and epidemiology
Fine-tuning LLMs to learn from tools and adaptive problem-solving
By
–