Search

OpenAI Introduces Flex Processing in API to Help Developers Cut AI Usage Costs

OpenAI says Flex processing will offer lower inference costs in exchange for slower response times.

Advertisement
Highlights
  • Flex processing is currently available in beta for o3 and o4-mini models
  • It can return a resource unavailability error occasionally
  • Flex processing will reduce API inference costs by half
OpenAI Introduces Flex Processing in API to Help Developers Cut AI Usage Costs

OpenAI recommends that developers increase the timeout duration for lengthy prompts

Photo Credit: Unsplash/Solen Feyissa

OpenAI introduced a new service tier for developers on Thursday via its application programming interface (API). Dubbed Flex processing, it reduces the AI usage costs by half for developers, compared to standard pricing. However, the lowered prices come with the consequence of slower response times and occasional resource unavailability. The new API feature is currently available in beta for select reasoning-focused large language models (LLMs). The San Francisco-based AI firm said this service tier can be useful for non-production and non-priority tasks.

OpenAI Adds New Service Tier in API

In its support page, the AI firm detailed this service tier. The Flex processing is currently available in beta for Chat Completions and Responses APIs, and works with the o3 and o4-mini AI models. Developers can set the service tier parameter to Flex in API request to activate the new mode.

One downside of the cheaper API pricing is that the processing time will be significantly higher. OpenAI says developers opting for Flex processing should expect slower response times and occasional resource unavailability. Additionally, users may also face API request timeout issues, in case the prompt is lengthy or the request is complex. As per the AI firm, this mode can be helpful for non-production or low-priority tasks such as model evaluations, data enrichment, or asynchronous workloads.

Notably, OpenAI highlights that developers can avoid timeout errors by increasing the default timeout. By default, these APIs are set to timeout at 10 minutes. However, with Flex processing, lengthy and complex prompts can take longer than that. The company suggests increasing the timeout will reduce the chances of getting a error.

Additionally, Flex processing might sometimes lack resources to handle developers' requests, and instead flag the “429 Resource Unavailable” error code. To manage these scenarios, developers can retry requests with exponential backoff, or switch to the default service tier if timely completion is necessary. OpenAI said it will not charge developers when they receive this error.

Currently, the o3 AI model charges $10 (roughly Rs. 854) per million input tokens and $40 (roughly Rs. 3,418) per million output tokens in the standard mode. The Flex processing brings down the input cost to $5 (roughly Rs. 427) and the output cost to $20 (roughly Rs. 1,709). Similarly, the new service tier will charge $0.55 (roughly Rs. 47) per million input tokens and $2.20 (roughly Rs. 188) per million output tokens for the o4-mini AI model, instead of $1.10 (roughly Rs. 94) for input and $4.40 (roughly Rs. 376) for output in the standard mode.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. iQOO Z10x Review: A Big Battery Budget Smartphone
  2. OnePlus Nord CE 5 Spotted on TDRA Website, Could Launch Soon
  3. SpaceX Launches 23 Starlink Satellites on Falcon 9 Rocket From Cape Canaveral
  4. ChatGPT Search Update Brings Shopping Feature, Multiple Citation Support
  5. Carl Pei on How Nothing and CMF Are Betting Big on India's Youth
  6. iQOO Z10 Turbo Pro, iQOO Z10 Turbo Debut With Sony LYT-600 Main Cameras
  7. iPhone 17 Pro to Arrive Without Anti-Reflective Coating on Display: Report
  8. Mastercard Partners OKX, Nuvei to Launch Payment Ecosystem for Stablecoins
  9. China Uses Gravitational Slingshots to Rescue Two Satellites Stuck in Orbit for 123 Days
  1. China Uses Gravitational Slingshots to Rescue Two Satellites Stuck in Orbit for 123 Days
  2. SpaceX Launches 23 Starlink Satellites on Falcon 9 Rocket From Cape Canaveral
  3. Amazon Launches 27 Satellites to Start Building Project Kuiper Internet Constellation
  4. Coinbase, Animoca Brands Announce Web3 Accelerator Initiative in the UK
  5. Xiaomi 16 Specifications Leaked; Said to Arrive With 6.3-Inch Display and Large Battery
  6. Vivo Y300 GT Launch Date Announced; Design, Key Features Revealed Days Ahead of Debut
  7. Reddit Expands AI-Powered Translation Feature for Posts and Comments in Hindi
  8. Abu Dhabi’s ADQ, FAB and IHC Announce Plans for Dirham-Backed Stablecoin
  9. Duolingo Outlines AI-First Strategy, Plans to Replace Contract Workers With AI Tools
  10. ED Said to be Quizzing Apple, Xiaomi in E-Commerce Probe on Amazon, Flipkart
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »