The model release cycles from Anthropic & OpenAI are genuinely insane, previously we had 6-12 months between updates, now models are released every 1.5 months. This is an under-appreciated reason why Anthropic & OpenAI are in the lead. Google's releases are not as fast,
@petergostev
-

Anthropic Acceleration: Model Release Cycle Shortening Trend
By
–

Anthropic released the next version sooner than I thought – the trend is accelerating – from 50-70 days before, down to 42 days since Opus 4.7
-
Devin AI Productivity Metrics: Calibrating Capability Claims
By
–
Explanation & chat link: "I digitized the curve from the image and used Cognition’s “>10x since start of 2026” claim to calibrate the relative shape. Then I anchored the absolute scale using the public “~1.1M PRs shipped with Devin” figure by Feb 2026: the chart area up to then
-
AI Agents as Platform Infrastructure and Service Layer
By
–
Agent is a platform could become a real thing. You will pay for your agent, then use that inference to access other services (e.g. shopping, finance, legal). You incentives are aligned (if its important, I want to spend a lot) and 3rd parties don't need to build their own agents
-
AI model quality and price evolution over time
By
–
This is elite data – how the pareto frontier moved over time. Took a lot of effort to get right. Huge shift in the model quality & price in the last 3 years. https://t.co/CtafQ4Gd66
— Peter Gostev (@petergostev) 21 mai 2026This is elite data – how the pareto frontier moved over time. Took a lot of effort to get right. Huge shift in the model quality & price in the last 3 years.
-
Gemini 3.5 Flash: strengths and inconsistencies observed
By
–
I did a video on Gemini 3.5 Flash – it is a pretty weird release, – went through dozens of examples and comparisons to other models. Some thoughts:
– It does WAY more than what you asked for
– It sometimes generates best in class stuff
– But sometimes crashes out and does -

Gemini 3.5 Flash performance in BullshitBench
By
–
BullshitBench update: Gemini 3.5 Flash did pretty badly – below a bit Gemma 4 even (31b high is 80th)
-
EU AI regulation critique and preemptive harm concerns
By
–
I don't believe pre agreeing anything works – eg EU did AI regulation which they started drafting before ChatGPT, as a result it is pretty much harmful without any of the benefits. Preparing some framework before we know what the world looks like could be harmful too. For
-

GPT Image 2 sees 50% usage growth and 1.5bn weekly images
By
–
GPT Image 2: two weeks after launching the model, usage is up 50% and 1.5bn images generated every week in ChatGPT only.
-
Coding with o3 model to avoid AI slowdown
By
–
If you feel like AI is slowing down, try coding with o3 – best model only a year ago
