Search

Google Releases Cost-Efficient and Low-Latency Gemini 2.5 Flash AI Model

Google said the Gemini 2.5 Flash model is ideal for responsive virtual assistants and real-time summarisation tools.

Advertisement
Highlights
  • Gemini 2.5 Flash comes with native reasoning capability
  • Google said it will be added to Vertex AI and AI Studio soon
  • Gemini 2.5 Flash can also be used to build AI agents
Google Releases Cost-Efficient and Low-Latency Gemini 2.5 Flash AI Model

There is no word on when the AI model will be rolled out to end consumers

Photo Credit: Google

Google released its second artificial intelligence (AI) model in the Gemini 2.5 family on Thursday. Dubbed Gemini 2.5 Flash, it is a cost-efficient low-latency model which is designed for tasks requiring real-time inference, conversations at scale, and those which are generalistic in nature. The Mountain View-based tech giant will soon make the AI model available on both the Google AI Studio as well as Vertex AI to help users and developers access the Gemini 2.5 Flash, and build applications and agents using it.

Gemini 2.5 Flash Is Now Available on Vertex AI

In a blog post, the tech giant detailed its latest large language model (LLM). Alongside announcing the debut of the Flash model, the post also confirmed that the Gemini 2.5 Pro model is now available on Vertex AI. Differentiating between the use cases of the two models, Google said the Pro model is ideal for tasks that require intricate knowledge, multi-step analyses, and making nuanced decisions.

On the other hand, the Flash model prioritises speed, low latency, and cost efficiency. Calling it a workhorse model, the tech giant said it is an “ideal engine for responsive virtual assistants and real-time summarisation tools where efficiency at scale is key.”

While launching the 2.5 Pro model, Google had specified that all LLMs in this series would feature natively built reasoning or “thinking” capability. This means the 2.5 Flash also comes with “dynamic and controllable reasoning.” Developers can adjust the processing time for a query based on the complexity, enabling them to get a granular control over the response generation times.

For its enterprise clients, Google is also introducing the Vertex AI Model Optimiser tool. Available as an experimental feature within the platform, it takes away the confusion of choosing a specific model when users are not sure. The feature can automatically generate the highest-quality response for each prompt based on factors such as quality and cost.

Google did not release a technical paper or model information card alongside the release, so information about its architecture, pre- and post-training processes, and benchmark scores are not known. The company might release it at a later time while making the model available to end consumers.

Meanwhile, the tech giant is also adding new tools to support agentic application building on Vertex AI. The company is adding a new Live application programming interface (API) for Gemini models that will allow AI agents to process streaming audio, video, and text with low latency to let it complete tasks in real-time.

The Live API, which is powered by Gemini 2.5 Pro, also supports resumable sessions longer than 30 minutes, multilingual audio output, time-stamped transcripts for analysis, tool integration, and more.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus Nord 5 Price in India, Launch Timeline, Key Features Leaked
  2. Poco F7 Listed on IMDA Certification Website, Could Launch Soon
  3. Global iQOO Neo 10 Model Spotted on Geekbench With This Chipset
  4. HeartBeat Season 2 OTT Release Date: When and Where to Watch it Online?
  5. Realme Narzo 80 Pro 5G Is Now Available in This Colour Variant in India
  6. Oppo Reno 14 With MediaTek Dimensity 8400 SoC Spotted on Geekbench
  7. Wednesday Season 2 OTT Release Date: When and Where to Watch Jenna Ortega Starrer Online?
  8. OnePlus Nord CE 5 Spotted on BIS Website, Could Launch in India Soon
  9. Samsung Galaxy S25 Ultra Outsold Other S25 Models in Global Markets
  1. Juno Mission Sheds Light on Jupiter’s Storms and Volcanic Activity on Io
  2. New Study Uncovers Shadowy Origins of Universe’s Most Luminous Phenomena
  3. NASA’s Psyche Mission Encounters Pressure Drop, Backup Systems on Standby
  4. ISRO Sets June 2025 Launch for Joint NASA-ISRO NISAR Satellite After Delays
  5. See a Wafer-Thin Crescent Moon Leapfrog Jupiter in the Post-Sunset Sky This Week
  6. Realme Narzo 80 Pro 5G Nitro Orange Colour Variant Launched in India: Price, Specifications
  7. Oppo Reno 14 With MediaTek Dimensity 8400 SoC Seen on Geekbench Ahead of Debut
  8. Honor 400 Pro Arrives on Geekbench With Snapdragon 8 Gen 3, 12GB of RAM
  9. Microsoft Raises Xbox and Game Prices, Citing Rising Costs
  10. Grand Theft Auto 6 Delayed to Next Year, Will Launch on May 26, 2026
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »