With all the saga around @windsurf_ai over the last 72 hours, it’s worth looking back to 2015, when VS Code was launched and open-sourced. The decision to open source ultimately led to at least 2 startups valued at $12bn+ in total. There are many stories to be told: pivots,
@petergostev
-

Grok 4 Performance Analysis on Independent Benchmarks
By
–
Since the Grok 4 release, several independent benchmarks have emerged. They aren't necessarily general or standard, but it is interesting to see how the official release (#1 everywhere) compares on benchmarks that Grok 4 likely wasn't fully optimised for. The results are quite
-

Grok 4 Token Efficiency Compared to o3 Model
By
–
Grok 4 is really expensive – not just because its price per token is higher (it's priced the same as Sonnet or around 2x o3), but also because it needs to generate about 2x more tokens than o3 to complete the same tasks. We can see this clearly from data provided by
-

Grok 4 Launch Reveals Critical AI Model Development Trends
By
–
The reason the Grok 4 launch is interesting isn't because this model is so great that you should drop everything and use it instead (it isn't). It's interesting because of what it tells us about model trends. What we've learned:
– Grok 4 had no extra pre-train compute compared -
o3 Research AI Model Parallel Verification Cross Check
By
–
o3 is just so sublime on research, I can't get enough of using it. If I need something more serious, I'd throw in parallel requests to o3 in multiple windows and cross check. It is a beast
-

Do Language Models Think Differently in Other Languages?
By
–
Do language models 'think' differently in other languages? The answer seems to be 'no'. When asked 'What's your favourite number?' in 30 different languages, models answered '7' 90% of the time and '42' 9% of the time. This mirrors the results we had with the question 'What's
-

OpenAI Allocates $1.19 in Research Funding Per Dollar Revenue
By
–
For every $1 you spend on OpenAI, you are giving $1.19 to its researches from @theinformation
-

AI Dominates Y Combinator: 85% Companies, Agents Leading Growth
By
–
AI is eating the start-up world. More than 85% of @ycombinator companies are AI companies, up from ~20% before ChatGPT launched. Of the 85%, about a third are AI Agent companies, which is the fastest growing category of start-ups (~doubled in 3 years). @garrytan @paulg
-
Models Still Lack Reliability for Real Agent Development
By
–
Models are just not there to build real agents, whenever we try to build even multi step workflows (I wouldn't call them agents), we get there but it takes way more time and effort than it might seem from initial testing. Models are flaky, they don't understand nuances, it takes
