AI Dynamics

Global AI News Aggregator

About

Evaluating World Model Capabilities Across GPT Versions

The debate over questions like “does GPT have a world model” continues. No real need to argue about it based on individual anecdotes — better to just write an OpenAI Eval, and check the trendline of performance across different GPTs: https://
github.com/openai/evals

→ View original post on X — @gdb