kapynBig Tech

OpenAI reportedly cut response costs for guest ChatGPT users by more than half

OpenAI slashed inference costs for ChatGPT by over 50%. This significant reduction was achieved through optimizations that drastically lowered the number of Nvidia GPUs required for serving guest users. The move highlights OpenAI's aggressive push towards greater efficiency and scalability in their AI operations.

The Decoder·Jun 30, 2026

Opening Kapyn…