*One might argue that modifying prompts (eg classic "think step by step") counts as inference-time scaling due to the extra output tokens. So in that sense they use inference-time scaling.
I should have been more clear: I meant specifically sampling techniques + using a verifier.
Inference-Time Scaling: Prompt Modification vs Sampling and Verification
By
–
Leave a Reply