Start w few tools to figure out what works/doesn’t. Use that knowledge to build a web search agent that can figure it out on its own. Use this to build an auto-updating db of all possible tools w documentation. Then fine tune a model w RLHF to decide which tool to use.
Building Self-Learning Web Search Agents with RLHF Fine-Tuning
By
–
Leave a Reply