Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

Gemini 1.5 Flash-8B is an experimental version of Gemini 1.5 Flash, first released last month.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 October 2024 14:22 IST
Highlights
  • Google has doubled the rate limits with Gemini 1.5 Flash-8B
  • The AI model costs $0.15 (roughly Rs. 12.5) per 1 million output tokens
  • Gemini 1.5 Flash-8B is said to be optimised for speed and efficiency
Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

Developers can access Gemini-1.5 Flash-8B for free via Google AI Studio and the Gemini API

Photo Credit: Google

Gemini 1.5 Flash-8B, the latest entrant in the Gemini family of artificial intelligence (AI) models, is now generally available for production use. On Thursday, Google announced the general availability of the model, highlighting that it was a smaller and faster version of the Gemini 1.5 Flash which was introduced at Google I/O. Due to being fast, it has a low latency inference and more efficient output generation. More importantly, the tech giant stated that the Flash-8B AI model is the “lowest cost per intelligence of any Gemini model”.

Gemini 1.5 Flash-8B Now Generally Available

In a developer blog post, the Mountain View-based tech giant detailed the new AI model. The Gemini 1.5 Flash-8B was distilled from the Gemini 1.5 Flash AI model, which was focused on faster processing and more efficient output generation. The company now claims that Google DeepMind developed this even smaller and faster version of the AI model in the last few months.

Despite being a smaller model, the tech giant claims that it “nearly matches” the performance of the 1.5 Flash model across multiple benchmarks. Some of these include chat, transcription, and long context language translation.

One major benefit of the AI model is its price effectiveness. Google said that the Gemini 1.5 Flash-8B will offer the lowest token pricing in the Gemini family. Developers will have to pay $0.15 (roughly Rs. 12.5) per one million output tokens, $0.0375 (roughly Rs. 3) per one million input tokens, and $0.01 (roughly Rs. 0.8) per one million tokens on cached prompts.

Advertisement

Additionally, Google is doubling the rate limits of the 1.5 Flash-8B AI model. Now, developers can send up to 4,000 requests per minute (RPM) while using this model. Explaining the decision, the tech giant stated that the model is suited for simple, high-volume tasks. Developers who wish to try out the model can do so via Google AI Studio and the Gemini API free of charge.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. iPhone 17 Pro, iPhone 17 Pro Max Alleged Geekbench Listing Leaked
  2. Nothing Phone 3 to Be Manufactured in India, Company Reveals Model Number
  3. Poco F7 Spotted on Geekbench With Snapdragon 8s Gen 4, 12GB of RAM
  4. OnePlus Nord 5 Allegedly Spotted on Geekbench With This Chipset
  5. Realme 15 Pro Tipped to Launch in India in These Colour Options
  6. Titan: The OceanGate Disaster Now Streaming on Netflix: What You Need to Know
  1. Hubble Finds Cosmic Dust Coating Uranus’ Moons, Not Radiation Scars
  2. New Theory Challenges Black Hole Singularities, But Critics Raise Red Flags
  3. Solar Orbiter Captures First-Ever Close-Up of Sun’s South Pole, Revealing Magnetic Field Chaos
  4. The Summer I Turned Pretty Season 3 OTT Release Date: When and Where to Watch Final Season Online?
  5. Mokshapatam Hindi OTT Release: Where to Watch it Online?
  6. Titan: The OceanGate Disaster Now Streaming on Netflix: What You Need to Know
  7. Stellar Blade Becomes Sony's Biggest Single-Player Steam Launch Ever a Day After PC Release
  8. Microsoft 365 Copilot Vulnerable to Zero-Click EchoLeak Exploit, Cybersecurity Researchers Say
  9. Samsung Rolls Out One UI 8 Beta 2 Update for Galaxy S25 Series in Select Countries
  10. Amazon Prime Video Now Shows Twice As Much Ads As Before: Report
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.