Opinion on DeepSeek R1, o1, and suspicious o3 benchmarks - AI Dynamics

Skip to content

AI Dynamics

Global AI News Aggregator

Rechercher

Opinion on DeepSeek R1, o1, and suspicious o3 benchmarks

By

–

30 May 2026 14h52

My personal vibes based opinion on this gap – at DeepSeek R1 level I believe this was real – o1 and r1 were not that far apart. From o3 onwards, I think there's something fishy going on with the benchmarks. Open models are not bad and certainly getting better, but the utility

→ View original post on X — @petergostev

30 May 2026

AI GENERATIVE AI LLMS MACHINE LEARNING OPEN SOURCE RESEARCH

←AI-Powered SecurOS UVSS Detects Explosives Under Vehicles in 3 Seconds

Self-supervised learning cuts annotation costs, speeds up AI model deployment.→

MORE ARTICLES

Paper praised for executing Gato idea with humanoid; more work desired

28 June 2026
Skild Brain AI enables robots to handle unfamiliar environments

28 June 2026
Proposal to replace Google Search with Gemini

28 June 2026
Using video to learn control representations, touch important

28 June 2026

INNOVATION GENERATIVE AI RESEARCH LLMS TOOLS MACHINE LEARNING CODE MARKET TRENDS TECHNOLOGY BUSINESS BIG TECH ETHICS ENTERPRISE AI SOFTWARE AGENTS AUTOMATION APPS COMPUTING DATA POLICY OPEN SOURCE MULTIMODAL AI REGULATION CULTURE CREATIVE AI PROMPT ENGINEERING SOCIETY ECONOMY SAFETY EDUCATION INVESTMENT AI HARDWARE AGI HARDWARE JOBS STARTUPS INDUSTRY ROBOTICS WORKFORCE SECURITY CYBERSECURITY HEALTHCARE AI SYSTEMS SUSTAINABILITY WEB3 DECENTRALIZED AI

AI Dynamics

Global AI News Aggregator

About
Archives
Contact

Rechercher