We trained an AI using process supervision — rewarding the thought process rather than the outcome — to achieve new state-of-art in mathematical reasoning. Encouraging sign for alignment of advanced AIs: …
https://
openai.com/research/impro
ving-mathematical-reasoning-with-process-supervision
…
Process Supervision Achieves State-of-Art Mathematical Reasoning
By
–
Leave a Reply