Datagrom AI News Logo

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations

January 10, 2025: Google DeepMind Tackles LLM Factuality Challenges - Google DeepMind researchers unveiled the FACTS Grounding benchmark to enhance the factual accuracy of large language models (LLMs) and minimize hallucinations in their outputs. This benchmark assesses LLMs' ability to produce precise, detailed responses from long-form documents.

A FACTS leaderboard on Kaggle ranks models, with Gemini 2.0 Flash currently at the top. Models are evaluated on diverse documents and judged by three LLMs to ensure unbiased, factual results. This initiative seeks to address LLM challenges and improve AI systems through rigorous benchmarking and ongoing development.

Link to article Share on LinkedIn

Get the Edge in AI – Join Thousands Staying Ahead of the Curve

Weekly insights on AI trends, industry breakthroughs, and exclusive analysis from leading experts.

Only valuable weekly AI insights. No spam – unsubscribe anytime.