⚒️ Can LMs (especially reasoning models) effectively self-refine their responses when prompted to do so? In our new challenging benchmark, RefineBench, we revisit this question and show that the answer is still "no"-but there is a nuance! 🤗 huggingface.co/papers/2511.2…
Can Language Models Effectively Self-Refine Their Responses?
By
–

Leave a Reply