AI Dynamics

Global AI News Aggregator

Reasoning Models: Why Listed Prices Don’t Match Actual Costs

// When Cheaper Reasoning Models End Up Costing More // The model you think is cheaper might actually cost you more. New research quantifies exactly how misleading listed API prices are. Across 8 frontier reasoning models and 9 tasks, 21.8% of model-pair comparisons exhibit pricing reversal, where the cheaper-listed model costs more in practice. The magnitude reaches up to 28x. Gemini 3 Flash is listed 78% cheaper than GPT-5.2, yet its actual cost is 22% higher. Claude Opus 4.6 is listed at 2x Gemini 3.1 Pro but actually costs 35% less. The root cause: thinking token heterogeneity. On the same query, one model may use 900% more thinking tokens. Why does it matter? Anyone choosing reasoning models for production needs to benchmark actual costs, not listed prices. Removing thinking token costs reduces ranking reversals by 70%. The authors release code and data for per-task cost auditing. Paper: arxiv.org/abs/2603.23971 Learn to build effective AI agents in our academy: academy.dair.ai/

→ View original post on X — @dair_ai, 2026-03-29 15:07 UTC

Commentaires

Leave a Reply

Your email address will not be published. Required fields are marked *