7. RL for Search-Efficient LLMs Proposes a new RL-based framework (SEM) that explicitly teaches LLMs when to invoke search and when to rely on internal knowledge, aiming to reduce redundant tool use while maintaining answer accuracy.
RL Framework Teaches LLMs Efficient Search Tool Usage
By
–
