Possibly GPT-5 being tested in ChatGPT https://t.co/gb4UUQXjXh
— Peter Gostev (@petergostev) 28 juillet 2025
Possibly GPT-5 being tested in ChatGPT
By
–
Possibly GPT-5 being tested in ChatGPT https://t.co/gb4UUQXjXh
— Peter Gostev (@petergostev) 28 juillet 2025
Possibly GPT-5 being tested in ChatGPT

By
–
Impact of Kimi K2 and Qwen 3 Coder on the LLM market, based on the @openrouter data in the 'programming' category. What we see is quite interesting: – Sonnet 4 models keep growing as if nothing happened – Gemini 2.5 Pro is losing share very quickly, from 15% to 9% in a
By
–
Are you getting Zenith now? Or is it an older one?

By
–
This is interesting, in my tests, Summit and Lobster (never got Zenith) were way better than Qwen3-Coder every single time. Expect that whatever @OpenAI model version makes it to the leaderboard will be miles above everything else. Nectarine and Starfish around Kiki K2 level
By
–
4.5 is the only model I trust with writing, esp re-writing without changing the style – all others don't understand the task
By
–
I'm surprised you managed to actually good coding results, for me the agent was reviewing the output and making the code worse
By
–
Why hasn't Microsoft trained an actually working PowerPoint and Excel agent? They have full software access, data, environments, compute – and importantly, unlike the SF tech companies, they realise how important PowerPoint and Excel actually are

By
–
As we get ready for GPT-5, it's useful to look back at how often labs featured in the Top 5 of @lmarena_ai over the last 1.5 years. The competition is primarily between OpenAI and Google. Average appearances overall and specifically in 2025:
– OpenAI: Overall: 2.2; in 2025: 1.7
By
–
It feels like Zenith might be the creative / normal part of GPT-5 and Summit is where the coding tasks would be routed. I haven't come across Zenith in any coding tasks, but in regular questions it tends to come up once in a while
By
–
Could GPT-5 be claimed to be 'AGI'? The funny thing about the OpenAI 'AGI clause' with Microsoft is that OpenAI needs to show it has developed 'systems' that have the 'capability' to generate $100bn in profits, not to actually generate the $100bn. I am not saying they will do