Search

Alibaba Qwen 2.5 Vision Language Model Released in a Smaller Size, Packs Agentic Capabilities

The new AI model is dubbed Qwen 2.5-VL-32B Instruct, and it joins the 3B, 7B and 72B sizes.

Advertisement
Highlights
  • The AI model is available to the open community with Apache 2.0 licence
  • Alibaba says its responses are more aligned with human preferences
  • Qwen-2.5-VL-32B outperforms AI models of comparable size
Alibaba Qwen 2.5 Vision Language Model Released in a Smaller Size, Packs Agentic Capabilities

Besides visual capabilities, the latest Qwen model also comes with improvements in text functions

Photo Credit: Reuters

Alibaba's Qwen team released another artificial intelligence (AI) model to the Qwen 2.5 family on Monday. Dubbed Qwen 2.5-VL-32B Instruct, the AI model comes with improved performance and optimisations. It is a vision language model with 32 billion parameters, and joins the three billion, seven billion, and 72 billion parameter size models in the Qwen 2.5 family. Just like all previous models by the team, it is also an open-source AI model available under a permissive license.

Alibaba Releases Qwen 2.5-VL-32B AI Model

In a blog post, the Qwen team detailed the company's latest vision language model (VLM). It is more capable than the Qwen 2.5 3B and 7B models, and smaller than the foundation 72B model. The large language model's (LLM) older versions outperformed DeepSeek-V3, and the 32B model is said to be outperforming Google and Mistral's similar sized systems.

Coming to its features, the Qwen 2.5-VL-32B-Instruct has an adjusted output style that provides more detailed and better-formatted responses. The researchers claimed that the responses are closely aligned with human preferences. Mathematical reasoning capability has also been improved, and the AI model can solve more complex problems.

The accuracy of image understanding capability and reasoning-focused analysis, including image parsing, content recognition, and visual logic deduction, has also been improved.

qwen25vl benchmark Qwen 2 5 VL 32B Instruct

Qwen 2.5-VL-32B-Instruct
Photo Credit: Qwen

 

Based on internal testing, the Qwen 2.5-VL-32B is claimed to have surpassed the capabilities of comparable models, such as Mistral-Small-3.1-24B and Google's Gemma-3-27B, on the MMMU, MMMU-Pro, and MathVista benchmarks. Interestingly, the LLM was also claimed to have outperformed the much larger Qwen 2-VL-72B model on the MM-MT-Bench.

The Qwen team highlights that the latest model can directly play as a visual agent that can reason and direct tools. It is inherently capable of computer use and phone use. It accepts text, images, and videos with more than one hour of duration as input. It also supports JSON and structured outputs.

The baseline architecture and training remain the same as the older Qwen 2.5 models, however, the researchers implemented a dynamic fps sampling to enable the model to comprehend videos at varying sampling rates. Another enhancement also lets it pinpoint specific moments in a video by gaining an understanding of temporal sequence and speed.

Qwen 2.5-VL-32B-Instruct is available to download on GitHub and its Hugging Face listing. The model comes with Apache 2.0 licence, which allows both academic and commercial usage.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. Insta360 X5 With Replaceable Lens System Launched in India: See Price
  2. Vivo T4 5G With Snapdragon 7s Gen 3 SoC, 7,300mAh Battery Debuts in India
  3. iQOO Z10 Turbo Series Set to Debut on This Date; Chipset, Battery Revealed
  4. Why Google Might Shift Production of Pixel Phones From Vietnam to India
  5. Madhav Sheth Joins Nxtcell to Lead Alcatel's Re-Entry to Indian Market
  6. Instagram to Use AI Tools to Spot Teens Who Falsely Claim to Be Adults
  7. Amazfit Active 2 With Up to 10 Days Battery Life Debuts in India
  8. Vivo Watch 5 With 5ATM Rating, Up to 22 Days Battery Life Launched
  1. Asus Vivobook S14, Vivobook S14 Flip With 13th Gen Intel Core i5 Processors Launched in India
  2. HTech's Madhav Sheth Joins Nxtcell to Lead Launch of Alcatel Smartphones in India; Teases New Honor Products
  3. Moto Tag With Support for Google's Find My Device Network Launched in India: Price, Features
  4. Insta360 X5 With AI-Powered PureVideo Low-Light Mode, Replaceable Lens System Launched in India
  5. USDC-Issuer Circle Plans Payment Network to Process Transactions via Stablecoins
  6. ElevenLabs Unveils Agent Transfer Feature to Share Data Between AI Agents
  7. Realme 14T Surfaces on Google Play Supported Devices List, Bluetooth SIG, Other Certification Websites
  8. Uber Sued by FTC Over ‘Deceptive’ Subscription Sign-Ups
  9. Huawei Enjoy 80 With 6,620mAh Battery, 50-Megapixel Camera Launched: Price, Specifications
  10. Google Could Use AI to Extend Search Monopoly, DOJ Says as Trial Begins
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »