Controlling LLMs' Thinking Effort: Reasoning Control Fields (RCF) for Long CoT This paper tackles underthinking (too shallow) and overthinking (too verbose) in long chain-of-thought (L-CoT) reasoning by introducing Reasoning Control Fields (RCF) — a test-time method that
Reasoning Control Fields Optimize Long Chain-of-Thought in LLMs
By
–
