Datagrom AI News Logo

Meta exec denies the company artificially boosted Llama 4’s benchmark scores

Meta exec denies the company artificially boosted Llama 4’s benchmark scores

April 7, 2025: Meta Refutes Claims of Llama 4 Benchmark Manipulation - Meta's VP of generative AI, Ahmad Al-Dahle, refuted claims that the company manipulated benchmark scores for its Llama 4 Maverick and Scout models by training them on test sets. The rumor surfaced from a Chinese social media post by a purported former employee, alleging Meta concealed the models' weaknesses.

Discrepancies in model performance reports and the use of an experimental Maverick version for benchmarks contributed to the speculation. Al-Dahle admitted to varied user experiences and pledged to resolve these issues as implementations stabilize.

Link to article Share on LinkedIn

Stay Current on AI in Minutes Weekly

Cut through the AI noise - Get only the top stories and insights curated by experts.

One concise email per week. Unsubscribe anytime.