OpenAI slashed inference costs for ChatGPT by over 50%. This significant reduction was achieved through optimizations that drastically lowered the number of Nvidia GPUs required for serving guest users. The move highlights OpenAI's aggressive push towards greater efficiency and scalability in their AI operations.
Opening Kapyn…