AI Dynamics

Global AI News Aggregator

Building Self-Learning Web Search Agents with RLHF Fine-Tuning

Start w few tools to figure out what works/doesn’t. Use that knowledge to build a web search agent that can figure it out on its own. Use this to build an auto-updating db of all possible tools w documentation. Then fine tune a model w RLHF to decide which tool to use.

→ View original post on X — @yoheinakajima,

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *