AI Dynamics

Global AI News Aggregator

About

RLHF Trade-offs: Control vs Capability in Advanced LLMs

Many possible reasons. I’d speculate that they want something more controlled because they have a lot to lose being in the #1 spot. RLHF can help capabilities if done right, but GPT-4 is just way overdone.

→ View original post on X — @mattshumer_