AI Dynamics

Global AI News Aggregator

About

Process Reward Models Grade Robot Performance Like Sports Replay

What if we could audit a robot's performance like a sports replay, grading every move, not just the final score? Researchers from Peking University, Chinese Academy of Sciences, and the Beijing Academy of AI present PRM-as-a-Judge. They use a "Process Reward Model" to watch a

→ View original post on X — @jiqizhixin