Search

Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Meta has also previewed Llama 4 Behemoth, the largest AI model in the family so far, with 288 billion active parameters.

Advertisement
Highlights
  • The Llama 4 Scout is a 17 billion active parameter model with 16 experts
  • The Maverick model has 17 billion active parameters and 128 experts
  • Llama 4 Behemoth is said to outperform GPT-4.5 and Gemini 2.0 Pro
Meta Llama 4 Scout and Maverick AI Models With MoE Architecture Released

Both Llama 4 Scout and Maverick are available on Hugging Face and the Llama website

Photo Credit: Meta

Meta introduced the first artificial intelligence (AI) models in the Llama 4 family on Saturday. The Menlo Park-based tech giant released two models — Llama 4 Scout and Llama 4 Maverick — with native multimodal capabilities to the open community. The company says these are the first open models built with Mixture-of-Experts (MoE) architecture. Compared to the predecessor, these come with higher context windows and better power efficiency. Alongside, Meta also previewed Llama 4 Behemoth, the largest AI model in the family unveiled so far.

Meta Llama 4 AI Models Arrive With MoE Architecture

In a blog post, the tech giant detailed its new AI models. Just like the previous Llama models, the Llama 4 Scout and Llama 4 Maverick are open-source AI models and can be downloaded via its Hugging Face listing or the dedicated Llama website. Starting today, users can also experience the Llama 4 AI models in WhatsApp, Messenger, Instagram Direct, and on the Meta.AI website.

The Llama 4 Scout is a 17 billion active parameter model with 16 experts, whereas the Maverick model comes with 17 billion active parameters and 128 experts. Scout is said to be able to run on a single Nvidia H100 GPU. Additionally, the company claimed that the previewed Llama 4 Behemoth outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several benchmarks. Meta said the Behemoth model, with 288 billion active parameters and 16 experts, was not released as it is still being trained.

llama 4 moe Llama 4 MoE Architecture

The MoE architecture in Llama 4 AI models
Photo Credit: Meta

 

Coming to the architecture, the Llama 4 models are built on an MoE architecture. The MoE architecture activates only a fraction of the total parameters based on the requirement of the initial prompt, which makes it more compute efficient for training and inference. In the pre-training phase, Meta also used new techniques such as early fusion to integrate text and vision tokens simultaneously, and MetaP to set critical model hyper-parameters and initialisation scales.

For post-training, Meta chose to start the process with lightweight supervised fine-tuning (SFT), followed by online reinforcement learning (RL) and lightweight direct preference optimisation (DPO). The sequence was chosen to not over-constrain the model. The researchers also performed SFT on only 50 percent of the “harder” dataset.

Based on internal testing, the company claimed that the Maverick model outperforms Gemini 2.0 Flash, DeepSeek v3.1, and GPT-4o on the MMMU (image reasoning), ChartQA (image understanding), GPQA Diamond (reasoning and knowledge), and MTOB (long context) benchmarks.

On the other hand, the Scout model is said to outperform Gemma 3, Mistral 3.1, and Gemini 2.0 on the MMMU, ChartQA, MMLU (reasoning and knowledge), GPQA Diamond, and MTOB benchmarks.

Meta has also taken steps to make the AI models safer in both the pre-training and post-training processes. In pre-training, the researchers used data filtering methods to ensure harmful data was not added to its knowledge base. In post-training, the researchers added open-source safety tools such as Llama Guard and Prompt Guard to protect the model from external attacks. Additionally, the researchers have also stress-tested the models internally and have allowed red-teaming of the Llama 4 Scout and Maverick models.

Notably, the models are available to the open community with a permissive Llama 4 licence. It allows both academic and commercial usage of the models, however, Meta no longer allows companies with more than 700 million monthly active users to access its AI models.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus 13T With 6.32-Inch OLED Screen, 6,260mAh Battery Launched
  2. Motorola Razr 60 Ultra, Edge 60 Pro Price, Features Surface Online Again
  3. Lenovo Launches IdeaPad Slim 3 (2025) in India With These Features
  4. YouTube Introduces New Features to Commemorate 20th Anniversary
  5. Ray-Ban Meta Glasses Will Soon Be Available in India, Says Meta
  6. The Elder Scrolls IV: Oblivion Remastered Seems to Be a Hit on Steam
  7. Apple Reportedly Plans to Open Two New Stores in Noida and Pune
  8. Red Magic 10 Air With 6,000mAh Battery, Snapdragon 8 Gen 3 Chip Launched
  9. Samsung Galaxy Fold 7 and Galaxy Flip 7 Could Launch in 'Early July'
  10. WhatsApp Rolls Out New Advanced Chat Privacy Feature: Here's How It Works
  1. Poco F7 Launch Timeline Surfaces Online, Tipped to Arrive By May-End
  2. Apple Reportedly Finalises Noida and Pune Locations for New Stores Amid India Expansion
  3. Google Upgrades Gemini 2.0 Flash With a Collaborative, Natural-Sounding Conversation Style
  4. HP EliteBook, ProBook and OmniBook AI Copilot+ PCs With Latest Intel, AMD, and Snapdragon Chips Launched in India
  5. Red Magic 10 Air With 6,000mAh Battery, Snapdragon 8 Gen 3 Chip Launched: Price, Specifications
  6. Honor X70i With Dimensity 7025 Ultra Chipset, 108-Megapixel Camera Launched: Price, Specifications
  7. PayPal Rewards Programme for PYUSD Stablecoin Holders Launched in Bid to Boost Adoption
  8. Honor Pad GT With 11.5-Inch Display, 10,100mAh Battery Launched Alongside Honor Band 10
  9. Google Chrome Worth ‘Upwards of $50 Billion,’ Browser Rival Says
  10. OnePlus 13T With 6.32-Inch OLED Screen, 6,260mAh Battery Launched: Price, Specifications
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »