Reasoning Models Are in Production. The Cost Structure Has Changed Fundamentally.
3 min readGPT-o3, Gemini 2.5 Pro, and DeepSeek R1 are in production — but at $15–60 per million output tokens, costs are 8–40x higher than standard models. Most organisations have not rebuilt their infrastructure assumptions to account for it.