As a general rule: if a model returns a response instantly, it's not doing test-time search — it's doing <10 forward passes. If it takes 5+ min to return something… it's doing test-time search. Thousands of iterations. If it takes 2+ hours… yep, definitely search.
Test-Time Search: Model Response Time Indicates Computational Complexity
By
–
Leave a Reply