The key distinction: More test-time compute is not automatically useful. It becomes useful when the model has learned an internal landscape where extra iterations move the latent state toward solution-aligned attractors rather than spurious ones. That is why the convergence