I=9
Meta launches Llama 4 Scout and Maverick, claims best-in-class performance
- Yesterday, Meta Platforms released two models of its Llama 4 model, the Llama 4 Scout and the Llama 4 Maverick.
- It said the Llama 4 Scout has 17 billion parameters and 16 experts, outperforms previous Llama models, fits on a single NVIDIA H100 GPU, supports a 10 million token context window, and beats Gemma 3, Gemini 2.0 Flash-Lite, and Mistral 3.1 on widely reported benchmarks. It added that it’s the best multimodal model in its class.
- It said Llama 4 Maverick has 17 billion parameters and 128 experts, is the best multimodal model in its class, beats GPT-4o and Gemini 2.0 Flash on reported benchmarks, matches DeepSeek V3 on reasoning and coding with less than half the parameters, and offers a best-in-class performance-to-cost ratio.
- It pointed out that both models were distilled from Llama 4 Behemoth, a 288-billion-parameter model with 16 experts that outperforms GPT-4.5, Claude Sonnet 3.7, and Gemini 2.0 Pro on several STEM benchmarks, though it’s still in development.