AI Dynamics

Global AI News Aggregator

About

RL ranking automation needs new human responses

The problem is the process ultimately depends on RL ranking generations made for the same prompt. If you need new human responses too, it can’t be automated.

→ View original post on X — @goodside