Are you seeing this with the new GPT-5 Thinking model that came out a few weeks ago? That's why I wrote about this now: I think it may have tipped over from completely untrustworthy to often (maybe even usually) correct
GPT-5 Thinking Model Shows Significant Reliability Improvement
By
–