I wouldn't discount it quite, yet. I am curious how it'll do with some additional RL + inference-compute scaling à la o1.
RL and inference-compute scaling potential for model improvement
By
–
By
–
I wouldn't discount it quite, yet. I am curious how it'll do with some additional RL + inference-compute scaling à la o1.