I was using Opus via Cursor, did an audit with Gemini 3.1 Pro, Opus 4.6 and GPT-5.4. Then I asked Opus to give assessment of the audit quality (anonymously). And I think it 100% nailed the current state of the models: Gemini 3.1 Pro: The weakest. Looked at the screen. Found the
Opus Audit Assessment of Leading AI Models Compared
By
–
Leave a Reply